Transcripts encoding immunomodulatory polypeptides

ABSTRACT

Substantially-isolated polynucleotides encoding human polypeptides having immunomodulatory activity; human homologs of yeast RAD50, Drosophila Septin-2 and rat Acyl-CoA Synthetase compositions and methods; method for detecting the presence of activated T-cells.

[0001] This application is a divisional of co-owned U.S. patentapplication Ser. No. 08/592,126, filed Jan. 26, 1996, which isincorporated herein by reference.

FIELD OF THE INVENTION

[0002] The present invention relates to immunomodulatory compositionsand methods.

REFERENCES

[0003] Alani, E., et al., Genetics 122:47-57 (1989).

[0004] Ausubel, F. M., et al., in CURRENT PROTOCOLS IN MOLECULARBIOLOGY, John Wiley and Sons, Inc., Media, Pa. (1988).

[0005] Balasubramanian, M. K., et al., J. Cell. Biol. 125:1289 (1994).

[0006] Beames, et al., Biotechniques 11:378 (1991).

[0007] Bellanne-Chantelot, C., et al., Cell 70:1059-1068 (1992).

[0008] Boyum, A., Scan J. Lab Invest 21:77 (1968).

[0009] Burke, D. T., et al., Science 236:806-812 (1987).

[0010] Chenchik, A., et al., Clontechniques X(1):5-8 (1995).

[0011] Chomczynski, P., and Sacchi, N., Anal. Biochem. 162:156 (1987).

[0012] Chumakov, I., et al., Nature 359:380-387 (1992).

[0013] Fujino, T. and Yamamoto, T., J. Biochem. 111:197-203 (1992).

[0014] Glisin, V., et al., Biochemistry 13:2633 (1974).

[0015] Georgopoulos, K., et al. EMBO J. 9:109-115 (1990).

[0016] Harlow, E., et al., in ANTIBODIES: A LABORATORY MANUAL, ColdSpring Harbor Laboratory Press (1988).

[0017] Innis, M. A., et al., in PCR PROTOCOLS, Academic Press (1990).

[0018] Jakobsen, K. S., et al., Nucleic Acids Res. 18:3669 (1990) .

[0019] Jakobsen, K. S., et al., “Direct mRNA Isolation Using MagneticOligo(dT) Beads: A Protocol for All Types of Cell Cultures, Animal andPlant Tissues” in ADVANCES IN BIOMAGNETIC SEPARATION, M. Uhlen, et al.,Eds., Eaton Publishing (1994).

[0020] Lewis, D. B., et al., Proc. Natl. Acad. Sci. USA 85:9743 (1988).

[0021] Longmire, J. L., et al., GATA 10:69-76 (1993).

[0022] Morgan, J. G., et al., Nucleic Acids Res. 20:5173-5179 (1992).

[0023] Mullis, K. B., U.S. Pat. No. 4,683,202, issued Jul. 28, 1987.

[0024] Mullis, K. B., et al., U.S. Pat. No. 4,683,195, issued Jul. 28,1987.

[0025] Piatak, M., et al., AIDS 7 (supp 2):S65-71 (1993).

[0026] Raymond, W. E. and Kleckner, N., Mol. Gen. Genet. 238:390-400(1993).

[0027] Reilly, P. R., et al., BACULOVIRUS EXPRESSION VECTORS: ALABORATORY MANUAL, 1992.

[0028] Sambrook, J., et al., in MOLECULAR CLONING: A LABORATORY MANUAL,Second Edition, Cold Spring Harbor Laboratory, Cold Spring Harbor, N.Y.(1989).

[0029] Siebert, P. D., et al., Nuc. Acids Res. 23(6):1087-1088 (1995).

[0030] Thorpe, R., et al., Blood Rev. 6:133-148 (1992).

[0031] Vandevyver, C., et al., Genome Res. 5:195-201 (1995).

[0032] Wadhwa, M., et al., in CYTOKINES: A PRACTICAL APPROACH, Balkwill,F. R., Ed., IRL Press, Oxford, 309-330 (1992).

[0033] Warrington, J. A., et al., Genomics 13:803-808 (1992).

BACKGROUND OF THE INVENTION

[0034] Cytokines and related immunomodulatory compounds play animportant role in the regulation and function of the immune system,making them suitable targets for therapeutic intervention in diseasesinvolving immune system dysfunction. It would therefore be desirable toidentify heretofore undiscovered genes encoding cytokines and otherimmunomodulatory compounds, which may be useful as a basis for treatmentof diseases affecting or influenced by the immune system. Presentmethods for the identification of such genes have met with limitedsuccess. These methods include (i) screening for DNAse I hypersensitivesites and HTF islands as potential markers for transcription units, (ii)cross-species hybridization analysis of genomic sequences, (iii)hybridization of radiolabelled cDNAs to arrayed genomic clones, (iv)screening of cDNA libraries with complex genomic probes, (v) exontrapping, (vi) random sequencing and assignment of tissue-specificcDNAs, (vii) “software trapping” of the genes in extensive genomicsequencing projects, and (viii) cDNA normalization, subtraction or/andhybridization selection using extensive genomic fragments.

[0035] Most of the above approaches have proven either unreliable, orhave required a substantial effort to find the genes of interest. Forinstance, a conventional “functional” gene cloning route includespurifying the protein factor with a particular biological activity,microsequencing the protein to design a redundant oligoprobe, raisingantibodies to the protein, expression cloning of the candidate gene orconventional screening of cDNA libraries with the redundant probe.

[0036] En masse cDNA sequencing efforts have contributed substantiallyto novel gene discovery by identifying a large number of novel sequencesand tissue expression “profiles”. However, because these effortstypically had no defined targets and depended on screening conventionalcDNA libraries, they resulted in the preferential identification ofcommon, abundant cDNAs, and were thus biased against the identificationof novel cytokine genes, which tend to be selectively expressed atrelatively low levels.

[0037] Exon trapping can be efficiently used to screen complex genomicDNA. This method is widely-used due to its independence of the geneexpression in any particular cell line or tissue, but it requiressubstantial further efforts for isolation and identification of thegenes in question.

[0038] Many of the difficulties in cytokine gene identificationmentioned above have been overcome by employing methods detailed in thepresent specification. These methods were used to isolate a number ofhuman cDNA fragments which may encode immunomodulatory molecules.

SUMMARY OF THE INVENTION

[0039] In one aspect, the present invention includes asubstantially-isolated polynucleotide having a sequence encoding a humanpolypeptide having immunomodulatory activity. In one embodiment, thepolynucleotide has the sequence represented as SEQ ID NO:65. In anotherembodiment, the polynucleotide has the sequence represented as SEQ IDNO:66. In another embodiment, the polynucleotide has the sequencerepresented as SEQ ID NO:67. In another embodiment, the polynucleotidehas the sequence represented as SEQ ID NO:68. In another embodiment, thepolynucleotide has the sequence represented as SEQ ID NO:70. In anotherembodiment, the polynucleotide has the sequence represented as SEQ IDNO:71. In another embodiment, the polynucleotide has the sequencerepresented as SEQ ID NO:72. In another embodiment, the polynucleotidehas the sequence represented as SEQ ID NO:73. In another embodiment, thepolynucleotide has the sequence represented as SEQ ID NO:74. In anotherembodiment, the polynucleotide has the sequence represented as SEQ IDNO:76. In another embodiment, the polynucleotide has the sequencerepresented as SEQ ID NO:78. In another embodiment, the polynucleotidehas the sequence represented as SEQ ID NO:79. In another embodiment, thepolynucleotide has the sequence represented as SEQ ID NO:82. In anotherembodiment, the polynucleotide has the sequence represented as SEQ IDNO:83. In another embodiment, the polynucleotide has the sequencerepresented as SEQ ID NO:85. In another embodiment, the polynucleotidehas the sequence represented as SEQ ID NO:86. In another embodiment, thepolynucleotide has the sequence represented as SEQ ID NO:88. In anotherembodiment, the polynucleotide has the sequence represented as SEQ IDNO:92. In another embodiment, the polynucleotide has the sequencerepresented as SEQ ID NO:95. In another embodiment, the polynucleotidehas the sequence represented as SEQ ID NO:98. In another embodiment, thepolynucleotide has the sequence represented as SEQ ID NO:99. In anotherembodiment, the polynucleotide has the sequence represented as SEQ IDNO:100. In another embodiment, the polynucleotide has the sequencerepresented as SEQ ID NO:104. In another embodiment, the polynucleotidehas the sequence represented as SEQ ID NO:105. In another embodiment,the polynucleotide has the sequence represented as SEQ ID NO:106. Inanother embodiment, the polynucleotide has the sequence represented asSEQ ID NO:107. In another embodiment, the polynucleotide has thesequence represented as SEQ ID NO:108. In another embodiment, thepolynucleotide has the sequence represented as SEQ ID NO:109. In anotherembodiment, the polynucleotide has the sequence represented as SEQ IDNO:112. In another embodiment, the polynucleotide has the sequencerepresented as SEQ ID NO:113. In another embodiment, the polynucleotidehas the sequence represented as SEQ ID NO:114. In another embodiment,the polynucleotide has the sequence represented as SEQ ID NO:115. Inanother embodiment, the polynucleotide has the sequence represented asSEQ ID NO:124. In another embodiment, the polynucleotide has thesequence represented as SEQ ID NO:130. In another embodiment, thepolynucleotide has the sequence represented as SEQ ID NO:132. In anotherembodiment, the polynucleotide has the sequence represented as SEQ IDNO:133. In another embodiment, the polynucleotide has the sequencerepresented as SEQ ID NO:134. In another embodiment, the polynucleotidehas the sequence represented as SEQ ID NO:135. In another embodiment,the polynucleotide has the sequence represented as SEQ ID NO:136. Inanother embodiment, the polynucleotide has the sequence represented asSEQ ID NO:137. In yet another embodiment, the polynucleotide has thesequence represented as SEQ ID NO:138.

[0040] In a preferred embodiment, the polynucleotide contains a sequenceselected from the group represented by SEQ ID NO:65, SEQ ID NO:66, SEQID NO:67, SEQ ID NO:68, SEQ ID NO:70, SEQ ID NO:71, SEQ ID NO:72, SEQ IDNO:73, SEQ ID NO:74, SEQ ID NO:76, SEQ ID NO:85, SEQ ID NO:86, SEQ IDNO:92, SEQ ID NO:95, SEQ ID NO:99, SEQ ID NO:105, SEQ ID NO:106, SEQ IDNO:107, SEQ ID NO:124, SEQ ID NO:130, SEQ ID NO:135, SEQ ID NO:136, SEQID NO:137 and SEQ ID NO:138. In another preferred embodiment, thepolynucleotide contains a sequence selected from the group representedby SEQ ID NO:65, SEQ ID NO:66, SEQ ID NO:67, SEQ ID NO:70, SEQ ID NO:71,SEQ ID NO:72, SEQ ID NO:73, SEQ ID NO:74, SEQ ID NO:75, SEQ ID NO:76,SEQ ID NO:78 and SEQ ID NO:79.

[0041] In another aspect, the present invention includes asubstantially-isolated human polypeptide having immunomodulatoryactivity, where the polypeptide has a sequence encoded by apolynucleotide having a sequence represented by SEQ ID NO:65. In anotherembodiment, the polypeptide has a sequence encoded by a polynucleotidehaving a sequence represented by SEQ ID NO:66. In another embodiment,the polypeptide has a sequence encoded by a polynucleotide having asequence represented by SEQ ID NO:67. In another embodiment, thepolypeptide has a sequence encoded by a polynucleotide having a sequencerepresented by SEQ ID NO:68. In another embodiment, the polypeptide hasa sequence encoded by a polynucleotide having a sequence represented bySEQ ID NO:70. In another embodiment, the polypeptide has a sequenceencoded by a polynucleotide having a sequence represented by SEQ IDNO:71. In another embodiment, the polypeptide has a sequence encoded bya polynucleotide having a sequence represented by SEQ ID NO:72. Inanother embodiment, the polypeptide has a sequence encoded by apolynucleotide having a sequence represented by SEQ ID NO:73. In anotherembodiment, the polypeptide has a sequence encoded by a polynucleotidehaving a sequence represented by SEQ ID NO:74. In another embodiment,the polypeptide has a sequence encoded by a polynucleotide having asequence represented by SEQ ID NO:76. In another embodiment, thepolypeptide has a sequence encoded by a polynucleotide having a sequencerepresented by SEQ ID NO:78. In another embodiment, the polypeptide hasa sequence encoded by a polynucleotide having a sequence represented bySEQ ID NO:79. In another embodiment, the polypeptide has a sequenceencoded by a polynucleotide having a sequence represented by SEQ IDNO:82. In another embodiment, the polypeptide has a sequence encoded bya polynucleotide having a sequence represented by SEQ ID NO:83. Inanother embodiment, the polypeptide has a sequence encoded by apolynucleotide having a sequence represented by SEQ ID NO:85. In anotherembodiment, the polypeptide has a sequence encoded by a polynucleotidehaving a sequence represented by SEQ ID NO:86. In another embodiment,the polypeptide has a sequence encoded by a polynucleotide having asequence represented by SEQ ID NO:88. In another embodiment, thepolypeptide has a sequence encoded by a polynucleotide having a sequencerepresented by SEQ ID NO:92. In another embodiment, the polypeptide hasa sequence encoded by a polynucleotide having a sequence represented bySEQ ID NO:95. In another embodiment, the polypeptide has a sequenceencoded by a polynucleotide having a sequence represented by SEQ IDNO:98. In another embodiment, the polypeptide has a sequence encoded bya polynucleotide having a sequence represented by SEQ ID NO:99. Inanother embodiment, the polypeptide has a sequence encoded by apolynucleotide having a sequence represented by SEQ ID NO:100. Inanother embodiment, the polypeptide has a sequence encoded by apolynucleotide having a sequence represented by SEQ ID NO:104. Inanother embodiment, the polypeptide has a sequence encoded by apolynucleotide having a sequence represented by SEQ ID NO:105. Inanother embodiment, the polypeptide has a sequence encoded by apolynucleotide having a sequence represented by SEQ ID NO:106. Inanother embodiment, the polypeptide has a sequence encoded by apolynucleotide having a sequence represented by SEQ ID NO:107. Inanother embodiment, the polypeptide has a sequence encoded by apolynucleotide having a sequence represented by SEQ ID NO:108. Inanother embodiment, the polypeptide has a sequence encoded by apolynucleotide having a sequence represented by SEQ ID NO:109. Inanother embodiment, the polypeptide has a sequence encoded by apolynucleotide having a sequence represented by SEQ ID NO:112. Inanother embodiment, the polypeptide has a sequence encoded by apolynucleotide having a sequence represented by SEQ ID NO:113. Inanother embodiment, the polypeptide has a sequence encoded by apolynucleotide having a sequence represented by SEQ ID NO:114. Inanother embodiment, the polypeptide has a sequence encoded by apolynucleotide having a sequence represented by SEQ ID NO:115. Inanother embodiment, the polypeptide has a sequence encoded by apolynucleotide having a sequence represented by SEQ ID NO:124. Inanother embodiment, the polypeptide has a sequence encoded by apolynucleotide having a sequence represented by SEQ ID NO:130. Inanother embodiment, the polypeptide has a sequence encoded by apolynucleotide having a sequence represented by SEQ ID NO:132. Inanother embodiment, the polypeptide has a sequence encoded by apolynucleotide having a sequence represented by SEQ ID NO:133. Inanother embodiment, the polypeptide has a sequence encoded by apolynucleotide having a sequence represented by SEQ ID NO:134. Inanother embodiment, the polypeptide has a sequence encoded by apolynucleotide having a sequence represented by SEQ ID NO:135. Inanother embodiment, the polypeptide has a sequence encoded by apolynucleotide having a sequence represented by SEQ ID NO:136. Inanother embodiment, the polypeptide has a sequence encoded by apolynucleotide having a sequence represented by SEQ ID NO:137. In yetanother embodiment, the polypeptide has a sequence encoded by apolynucleotide having a sequence represented by SEQ ID NO:138.

[0042] In a preferred embodiment, the polypeptide has a sequence encodedby a polynucleotide selected from the group consisting of SEQ ID NO:65,SEQ ID NO:66, SEQ ID NO:67, SEQ ID NO:68, SEQ ID NO:70, SEQ ID NO:71,SEQ ID NO:72, SEQ ID NO:73, SEQ ID NO:74, SEQ ID NO:76, SEQ ID NO:85,SEQ ID NO:86, SEQ ID NO:92, SEQ ID NO:95, SEQ ID NO:99, SEQ ID NO:105,SEQ ID NO:106, SEQ ID NO:107, SEQ ID NO:124, SEQ ID NO:130, SEQ IDNO:135, SEQ ID NO:136, SEQ ID NO:137 and SEQ ID NO:138. In anotherpreferred embodiment, the polypeptide has a sequence encoded by apolynucleotide selected from the group consisting of SEQ ID NO:65, SEQID NO:66, SEQ ID NO:67, SEQ ID NO:70, SEQ ID NO:71, SEQ ID NO:72, SEQ IDNO:73, SEQ ID NO:74, SEQ ID NO:75, SEQ ID NO:76, SEQ ID NO:78 and SEQ IDNO:79.

[0043] In another aspect, the present invention includes asubstantially-isolated polynucleotide having a sequence encoding a humanhomologue of yeast RAD50. In one embodiment, the polypeptide contains apolypeptide sequence encoded by a polynucleotide sequence selected fromthe group consisting of SEQ ID NO:54 and SEQ ID NO:55.

[0044] In a related aspect, the invention includes asubstantially-isolated human homolog of yeast RAD50 polypeptide. In oneembodiment, the homolog polypeptide contains a polypeptide sequenceencoded by a polynucleotide sequence selected from the group consistingof SEQ ID NO:54 and SEQ ID NO:55.

[0045] Yet another aspect of the present invention includes asubstantially-isolated polynucleotide having a sequence encoding a humanhomologue of Drosophila melanogaster Septin-2. In one embodiment, thepolypeptide contains a polypeptide sequence encoded by thepolynucleotide sequence represented by SEQ ID NO:97.

[0046] In a related aspect, the invention includes asubstantially-isolated human Septin-2 homolog polypeptide. In oneembodiment, the homolog polypeptide contains a polypeptide sequenceencoded by the polynucleotide sequence represented by SEQ ID NO:97.

[0047] Still another aspect of the present invention includes a methodof identifying the presence of activated T-cells in a sample containinga plurality of different cell types. The method includes performing apolymerase chain reaction amplification, where an aliquot of the sample(or homogenate/fraction thereof) serves as an amplification target andwhere the amplification is done using an oligonucleotide primer paircapable of selective amplification of a polynucleotide fragment havingthe sequence represented as SEQ ID NO:151. The amplification reactiongenerates an amplification product having a specific size, and the sizeof the amplification product is determined. The presence ofamplification product of an expected size is indicative of the presenceof activated T cells in the sample. In one embodiment, theoligonucleotide primer pair consists of primers having sequencesrepresented as SEQ ID NO:149 and SEQ ID NO:150. In another embodiment,the sample is derived from adult tissue.

[0048] The invention also encompasses a method of identifying sequencesencoding polypeptides having immunomodulatory activity. The methodincludes (i) selecting, by direct selection using sequences specific forregion 5q23-31 of human chromosome 5, cDNA fragments isolated fromtissues or cells expressing cytokines, (ii) grouping the fragments into“bins”, where each bin represents cDNA fragments corresponding to asingle gene or genetic locus, the grouping performed by sequencing thefragments and/or mapping the fragments to longer sequences derived fromregion 5q23-31 of human chromosome 5, and (iii) analyzing the tissuespecificity of expression of transcripts corresponding to the fragments(transcripts from the gene or locus which the fragments represent). Inone embodiment, the first step (step (i)) is performed using cDNAsobtained from cell lines and/or tissues expressing cytokines, such asactivated T-cells. In another embodiment, the first step is performedusing cDNAs obtained from a chromosome 5-specific activated T-cell cDNAlibrary in lambda gt10, which was constructed using a kit from LifeTechnologies, Inc. and is deposited at Genelabs Technologies, Inc.,Redwood City. In another general embodiment, the analyzing oftissue-specific expression is carried out using sequence-specificprimers in a polymerase chain reaction amplification reaction containingtarget nucleic acids derived from tissues or cell lines of interest.Examples of tissues which may be used in determining the tissuespecificity of expression include total embryo, fetal liver, fetalbrain, fetal muscle, placenta, adult heart, adult muscle, adult liver,adult brain, adult pancreas, adult kidney, adult aorta, adult spleen,adult testis, adult bone marrow, resting T-cells and activated T-cells.

[0049] The present invention also includes a method of obtainingfull-length sequences of genes or loci identified as havingimmunomodulatory activity. The method includes selecting a desiredsequence identified in Table 1 and using the sequence to isolateoverlapping clones. In one embodiment, such overlapping clones areisolated using rapid amplification of cDNA ends (RACE) PCR with cDNAobtained from tissues or cell lines of interest or from a cDNA orgenomic DNA library. In another embodiment, the overlapping clones areisolated by direct hybridization screening of a cDNA or genomic DNAlibrary made from, for example, T-cells, a lymphoma or a leukemia.

[0050] Also included in the invention is a method of identifyingproteins having immunomodulatory activity. The method includes obtaininga full-length coding sequence of a gene represented by a sequencepresented in Table 1 (e.g., as described above) and cloning the sequenceinto a recombinant expression vector. The resulting vector is then usedto express recombinant polypeptides in selected host cells, such as E.coli.

[0051] The invention also includes a method of identifying smallmolecules that affect alter and/or modulate the activity ofimmunomodulatory proteins such as described above. The method includesassaying the effects of a polypeptide having immunomodulatory activityin the presence and absence of a test small molecule compound, andidentifying the test compound as effective if the test compound iseffective to significantly alter the effects of the polypeptide. In oneembodiment, the small molecule compound is one of a plurality of suchcompounds present in a combinatorial library, such as one of a pluralityof small molecules in a small molecule combinatorial library, or one ofa plurality of peptides in a peptide combinatorial library.

[0052] These and other objects and features of the invention will bemore fully appreciated when the following detailed description of theinvention is read in conjunction with the accompanying drawings.

BRIEF DESCRIPTION OF THE FIGURES

[0053]FIG. 1 shows the location of primers SEQ ID NO:149 and SEQ IDNO:150 relative to sequence SEQ ID NO:151.

[0054]FIG. 2 is an image of an ethidium bromide-stained agarose gel,showing the expression pattern of SEQ ID NO:151.

DETAILED DESCRIPTION OF THE INVENTION I. Definitions

[0055] “Substantially isolated” when used with respect to polynucleotideor polypeptides refers to the at least partial purification of suchpolynucleotides or polypeptides away from unrelated or contaminatingcomponents (e.g., cellular components other than the specifiedpolynucleotide or polypeptide, and polypeptides or polynucleotideshaving a sequence different from that of the selected polypeptide orpolynucleotide. Methods and procedures for the isolation or purificationof compounds or components of interest are described below (e.g.,recombinant production of polypeptides having immunomodulatoryactivity).

[0056] Compounds or polypeptides having “immunomodulatory activity” arecompounds or polypeptides that affect the regulation or function of theimmune system. Examples of compounds or polypeptides havingimmunomodulatory activity include but are not limited to cytokines,which include growth factors, colony- stimulating factors, interleukins,lymphokines, monokines, interferons, chemokines and the like. Suchpolypeptides are typically secretory regulatory proteins that controlthe survival, growth, differentiation and effector function of tissuesor cells. Polypeptides having immunomodulatory activity also includereceptors for immunomodulatory compounds or polypeptides, including butnot limited to, cytokine receptors, which include interleukin receptors,growth factor receptors, interferon receptors and receptors for otherfactors. Other examples of immunomodulatory compounds or polypeptidesinclude transcription regulatory factors and signal transductiontransmitters, such as NF-kB, interleukin regulatory factor 1 (IRF1),interleukin regulatory factor 2 (IRF2), G-proteins, signal transducersand activators of transcription (STATs), cell division control proteins,proteins involved in DNA repair and recombination, etc. that areexpressed in human stromal or immune cells or tissues.

[0057] “Adult tissue” refers to tissue isolated from individuals olderthan about 1 year of age.

II. Cytokine Gene Cluster on Chromosome 5

[0058] Gene families tend to evolve by a process of tandemization,divergence, and in some cases, transposition. Linked families of genesare usually assumed to be together because they evolved from a commonancestor rather than being locked into a functional unit within achromosomal region. There are numerous examples of linked genes thatshow strong homology to each other (e.g. HLA) but their are also manyexamples of genes that are strongly homologous but are scatteredthroughout the genome (e.g. tubulin genes).

[0059] Cytokine genes differ from these cases because they typically donot show strong homology at the nucleic acid sequence level, and shouldnot necessarily be clustered in chromosomal regions. It is has beenrecognized herein, however, that there exist at least nine cytokinegenes and at least ten receptor genes on the long arm of humanchromosome 5 (e.g., Warrington, et al., 1992), suggesting thatfunctionally-related molecules having little or no sequence homology maybe situated together in a defined region of a chromosome.

III. Direct Selection and Analysis of Chromosome 5—Specific cDNASequences

[0060] Experiments performed in support of the present invention detailthe generation of cDNA samples enriched for sequences from the 5q23-31region of human chromosome 5. This region has been identified ascontaining a cluster of cytokine genes, including IL13, IL4, IL5, IRF1,IL3 and GM-CSF. Such immunomodulatory molecules may be involved in thedevelopment of certain cancers and immunodeficiencies, making themsuitable targets for anti-cancer and immunotherapeutic drug candidates.The cDNA samples were derived from a variety of tissues, including humanfetal brain and liver, adult bone marrow, leukemias, lymphomas,activated lymphocytes and cytokine-producing clones, as detailed inExample 1A. The samples were assayed for the presence of known cytokinesas detailed in Example 1B using primers shown in Table 2. Results ofthese assays are shown in Table 3. Those samples showing increasedexpression of cytokines were combined to create “cDNA pools”. Thecomposition of the different pools is detailed in Example 1.

[0061] A similar approach may be employed to obtain cDNA samplesenriched for various other selected sequences. For example, cDNAsequences upregulated during periods of increased synaptic transmissionmay be isolated from hippocampal slices following electrical stimulationof the slices. Such cDNA samples may be assayed for, e.g., cDNAsencoding other classes of selected molecules, such as protein kinases,phosphatases, neurotransmitters, hormones, and the like.

[0062] Pools containing relatively high levels of cDNAs encodingdifferent cytokines (see Example 1, Table 3) were further processedusing genomic “direct selection”, as detailed in Examples 2 and 3. Here,yeast artificial chromosome (YAC) clones containing the 5q23-31 regionof chromosome 5 were used to select cDNA that hybridized to sequences inthat region. Analysis of approximately 3,000 cDNAs selected with thegenomic region spanning 1.3 Mb of 5q23-31 revealed several hundred cDNAclones ranging from about 500 to about 800 bp in length. The sequenceswere further analyzed by mapping them to YAC clones containing fragmentsof the 5q23-31 region. About 79% of these clones were mapped to humanchromosome 5 and starting YACs either by RT-PCR or Southern blothybridization.

[0063] The data obtained from the physical mapping of the cDNAs to thestarting YACs and chromosome 5-specific cosmids were used to group thecDNAs according to their location and partial overlap with one another,resulting in over 50 groups, or “bins”, of cDNAs comprised ofoverlapping clones. Some of the selected cDNAs were also sequenced asdescribed in Example 4 to facilitate placement into the bins. Theresults of these analyses are presented in Table 1, below. TABLE 1 BinConsensus SEQ Sequence Best BlastX Expression # sequence NO. Type lengthhomology score profile 1 Rad50.seq 54 con1 ˜5.7 kb ˜35% overallActivated T- 18.seq 55 con2 homology to cells, S. cerevisiae Rad50:testis, fetal Score = 390, P = 3.8e−89 liver, heart 2 Tc1.seq 56 altmultiple ˜90% overall Activated T- Tc2.seq 57 alt isoforms: homology tothe Rat cells, Tc3.seq 58 alt ˜2.5 - Brain Long Chain testis, fetalTcA.seq 59 alt 0.6 kb Acyl-CoA Synthetase liver TcB.seq 60 alt TS.seq 61alt TS2.seq 62 alt FL.seq 63 alt FL2.seq 64 alt 3 G205a.seq 65 con1 ˜1.0kb homology to 1-PI 3- Activated T- G205b.seq 66 con2 kinase: Score =66, cells*, fetal G205c.seq 67 con3 P = 0.024, (14/29) . liver 4G221.seq 68 con ˜1.70 kb homology to Activated T- S. cerevisiae ZMS1cells*, gene: Score = 75, testis, fetal P = 0.038, (19/44); thymushomology to FGF: Score = 62, P = 0.74, (14/52) 5 G238con.seq 69 con ˜1.4kb homology to Activated T- drosophila Notch 2 cells, gene: Score = 56,testis, fetal P = 0.00058, (12/29) thymus 6 G229con.seq 70 con ˜2.76 kbNSM Activated T- cells** 7 G248.seq 71 con1 NSM Activated T- G248a.seq72 con2 cells G248b.seq 73 con3 G248c.seq 74 con4 G220a.seq 75 con5G255.seq 76 con6 8 G306.seq 77 con ˜0.65 kb homology to M. musculusModifier 3: Score = 67, P = 0.21 (12/17) 9 G256.seq 78 con ˜0.90 kbhomology to mouse Activated T- formin 4 gene: cells** Score = 71, P =1.8e−09 10 G181.seq 79 con ˜1.40 kb homology to Activated T- P.Aeroginosa cells** hypothetical 62.8K protein: Score = 73, P = 0.33(13/26) 11 G257.seq 80 con ˜0.70 kb homology to Lung, M. Sativa NADH-activated T- glutamate synthase: cells, brain, Score = 69, P = 0.33liver and (13/26) heart 12 E2.seq 81 con1 ˜0.7 kb; NSM E2: kidney,E9f.seq 82 con2 E9: activated T- E9r.seq 83 con3 ˜1.0 kb; cells, fetalG123con.seq 84 con4 ˜0.32 kb liver and muscle, bone marrow; E9:activated T-cells, fetal liver, testis, brain, kidney, small intestine13 A116con.seq 85 con ˜3.1 kb NSM Activated T- cells, placenta, fetalliver and muscle, kidney, heart, bone marrow 14 A25con.seq 86 con ˜1.9kb NSM Activated T- cells, fetal liver and muscle, placenta, kidney,bone marrow 15 A46.seq 87 con ˜0.85 kb NSM fetal liver, kidney andmuscle, placenta, activated T- cells, heart, bone marrow 16 A66.seq 88con ˜0.68 kb NSM activated T- cells, fetal liver, placenta, heart, bonemarrow 17 A42.seq 89 con ˜0.57 kb homology to rabbit Activated T- T-cellreceptor, cells, fetal beta chain: muscle, Score = 59, P = 7.9-0.6placenta, (21/60) heart, kidney 18 A76conseq 90 con ˜1.044 bp homologyto ATP Activated T- synthase: Score = 67, cells, P = 5.4-0.6 (21/60)placenta, and ubiquinone: heart Score = 63, P = 8.8-0.6 (11/28) 19E105con.seq 91 con ˜1.7 kb homology to 3 ESTs 20 G180con.seq 92 con˜0.93 kb 21 G310con.seq 93 con1 ˜2.10 kb NSM G310: lung, G326con.seq 94con2 ˜1.38 kb liver, brain, G164con.seq 95 con3 ˜1.10 kb smallintestine, testis, activated T- cells; G326: lung, liver, brain, thymus,activated T- cells; G164: lung, liver, brain, thymus, ovary, activatedT- cells; 22 G65.seq 96 con ˜2.1 kb ˜60% aa homology to human and chickpropyl 4- hydroxylase, procollagen-proline dioxygenase, gamma-butyrobetaine, 2- oxoglutarate dioxygenase: Score = 797, P = 5.1e−164 23CDC3.seq 97 con ˜1.2 kb homology to (partial Drosophila Septin-sequence) 2; ˜50% aa homology to human and yeast cell division controlproteins, such as CDC3 and CDC10: Score = 196, P = 2.9e−63 24 G42con.seq98 con ˜0.5 kb NSM 25 G105con.seq 99 con ˜0.64 kb NSM Activated T-cells, fetal liver, kidney, lung, small intestine, heart, brain, spleenand testis 26 G98con.seq 100 con ˜0.41 kb NSM 27 G73con.seq 101 con ˜0.4kb ˜76% homology to Ubiquitous human ubiquinol cytochrome C reductase:Score = 338, P = 6.3e−45 28 G89con.seq 102 con ˜2.2 kb homology toProstate, (partial X. laevis apical brain, sequence) plasma membranekidney, protein: Score = 149, liver, small P = 2.7e−35 intestine,placenta 29 G102.seq 103 con ˜0.4 kb homology to human ataxin-1 gene:Score = 67, P = 0.2 (13/24) 30 G57.seq 104 con ˜0.4 kb NSM 31 G108.seq105 con ˜0.25 kb NSM Activated T- cells, small intestine 32 G127.seq 106con ˜0.42 kb NSM Activated T- cells, fetal liver, heart, kidney, brain,spleen, placenta, testis, small intestine 33 G86.seq 107 con ˜028 kb NSMActivated T- cells, brain 34 G78.seq 108 con1 ˜0.5 kb NSM H993.seq 109con2 35 G38a.seq 110 con ˜0.5 kb NSM 36 H90.seq 111 con NSM 37 G66.seq112 con1 NSM H973.seq 113 con2 38 H505.seq 114 con1 NSM H989.seq 115con2 39 E118con.seq 116 con ˜0.82 kb ˜95% homology to Activated T-several ESTs in cells, fetal Genbank muscle and liver, heart, kidney,brain muscle, aorta, placenta 40 E69f.seq 117 con1 ˜0.9 kb 100% homologyto E69r.seq 118 con2 human eF-1 alpha gene 41 E36.seq 119 con NSM 42A104f.seq 120 con1 ˜0.8 kb 100% homology to A104r.seq 121 con2 humanserine protease B gene 43 H622.seq 122 con ˜0.59 kb homology to humangamma-G globin gene: Score 348, P = 7.2-45, (67/70) 44 G61con.seq 123con ˜0.7 kb NSM 45 G45.seq 124 con ˜0.29 kb Lung, kidney, brain, thymus,fetal liver and brain, activated T- cells 46 G3con.seq 125 con ˜1.26 kbNSM 1a G30.seq 126 con ˜0.32 kb NSM Lung, brain, kidney, heart, muscle,liver, placenta, small intestine, activated T- cells 2a G32.seq 127 con˜0.38 kb NSM Ubiquitous 3a G37.seq 128 con ˜0.4 kb NSM 4a G39.seq 129con ˜0.43 kb NSM Kidney, fetal liver, activated T- cells 5a G75.seq 130con ˜0.4 kb NSM Kidney, fetal liver, small intestine, activated T- cells6a H100.seq 131 con ˜0.37 kb 100% to known human H19 gene 7a H414f.seq132 con ˜0.8 kb NSM 8a H631.seq 133 con ˜0.5 kb NSM 9a G93.seq 134 con˜0.4 kb NSM 10a G115a 135 con1 ˜2.0 kb homology to Kidney, lung, G115b136 con2 (partial drosophila homeotic liver, brain, G115c.seq 137 con3sequence) Cad gene: Score = 69, heart, P = 0.12 (12/19) placenta,spleen, small intestine, testis, muscle, activated T- cells, fetal liver11a G122.seq 138 con 0.25 kb homology to human Activated T-heparin-binding cells, growth factor: spleen, fetal Score = 57, P =0.26, liver (11/26) 12a G329f.seq 139 con ˜1.00 kb 100% homology tohuman HSP70 gene 13a E67.seq 140 con ˜2.57 kb ˜90% homology to the humanubiquitin gene: Score = 460, P = 1.8e−119 14a E94.seq 141 con 0.63 kbhomology to lilium longiformium HSP70 gene: Score = 62, P = 0.48 (12/28)

[0064] ** expression profile tested using only samples containingactivated T-cells (no other tissues tested)

[0065] The bin number, name, type, SEQ ID NO: and approximate size ofeach sequence are provided in the first set of columns. The “type” ofsequence is either (i) a single contiguous consensus sequence derivedfrom the overlapping cDNA clones that comprise that bin (“con”), (ii)two or more non-overlapping consensus sequences within a bin (e.g.,“con1”, “con2” , . . . ), representing consensus sequences covering,e.g., a 3′ and a 5′ portion of a region that has not been completelysequenced, or (iii) alternatively-spliced variants (“alt”) derived fromthe same “parent” sequence.

[0066] All sequences identified in Table 1 were analyzed by subjectingthem to a “BLASTX” homology search against a protein sequence database(PIR+SWISS-PROT). The results of these analyses are also presented inTable 1. In cases where the BLASTX search did not yield a significantmatch, the cell in the Table is labeled “NSM” (no significant matches).

[0067] The last column of Table 1 presents a summary of experimentsperformed to address the expression patterns of the various cDNAs. Mostof these experiments were performed using RT-PCR with primers specificfor the consensus sequence representing each bin. The details of theexperimental methods are presented in Example 6B. Primers specific forthe sequences to be amplified were constructed using standard methods.The primers were selected such that the expected amplification productswere typically between 200 and 1000 bp in length. The following tissueswere used for the RT-PCR reactions: total embryo (6, 8, 12 weeks ofgestation), fetal liver, fetal brain, fetal muscle, placenta, adultheart, adult muscle, adult liver, adult brain, adult pancreas, adultkidney, adult aorta, adult spleen, adult testis, adult bone marrow, JYB-cell line, resting T-cells and activated T-cells. The RT-PCRexpression analyses revealed that many of the novel, previouslyuncharacterized cDNAs, were expressed in activated T-cells, suggestingthat they may encode novel immunomodulatory molecules.

[0068] Many of the consensus sequences presented above were arrived atby analysis of a number of overlapping clones. In some cases, the clonesoverlapped only near their ends and the sequence between the overlapswas derived from a single cDNA. In such cases and some others, theconsensus sequence may contain alternatively-spliced sequences from thesame gene, even though different alternatively-spliced transcriptsderived from the same region of the gene were not detected. Suchalternatively-spliced sequences in the consensus sequences may havedifferent tissue specificities, and thus give rise to different patternsof expression, depending on which portion of the cDNA is beingamplified. In cases where different patterns of expression for cDNAsthat were part of the same consensus sequence were detected, the“expression profile” in Table 1 above lists all the tissues in whichexpression of any of the cDNAs constituting the bin or consensussequence was detected, unless indicated otherwise.

[0069] One such difference in expression profiles was observed with thecDNAs comprising bin 13 (All6con.seq; SEQ ID NO:85). A series ofexperiments designed to detect expression profiles of various cDNAswhich comprised SEQ ID NO:85 yielded the results shown in Table 1.However, as described in Example 9 and illustrated in FIG. 2,experiments using primers SEQ ID NO:149 and SEQ ID NO:150, designed toamplify the indicated portion of the sequence shown in FIG. 1 (SEQ IDNO:151), consistently detected this transcript only in activatedT-cells. Accordingly, amplification of this DNA fragment may be used asa sensitive method to detect the presence of activated T-cells.

[0070] The cDNAs identified as described above may also be used toidentify the chromosome region from which they are derived, usingstandard mapping techniques, as detailed, for instance, in Example 5.Such mapping information may be used, for example, to identify cloneswhich map to regions implicated in genetic diseases or disorders.

[0071] Physical mapping of selected cDNAs can be done by a variety ofmeans. A particularly rapid method is PCR mapping to the YAC clones andnatural or radiation hybrid panels that carry whole human chromosome 5or its portions. Such experiments preferably include appropriatecontrols, such as no genomic DNA, total genomic DNA, rodent genomic DNAthat is present in radiation hybrids, etc. Another approach employsSouthern blot hybridization of the YAC clones or genomic DNA, andisolation of chromosome 5-specific cosmids or any other genomic clones,such as bacteriophage 1 (P1) or bacterial artificial chromosomes (BACs),with cDNA as a probe.

[0072] Some of the nucleotide sequences presented in Table 1 do notinclude the entire coding region of the gene to which they correspond.In such cases, the remaining sequences may be obtained by one of skillin the art using the sequence information and teachings contained in thepresent specification combined with standard molecular techniques. Forexample, the cDNA clones described herein, or fragments thereof, may beused to screen a cDNA library constructed from, e.g., human T-cellsusing standard methods (e.g., Ausubel, et al., 1988). Such libraries arecommercially available, for example, from Clontech (Palo Alto, Calif.).Full length clones may also be obtained by similar screening of cDNApools generated as described herein.

[0073] Using the sequence information disclosed herein, one of skill inthe art employing standard techniques may derive near full length cDNAs,express and purify protein products of such cDNAs, and confirm functionusing, e.g., gene knock-out experiments. This information may then beused to develop specific assay systems to test for biological activitiesand to screen for therapeutic compounds that modulate those activities.

IV. Characterization of Exemplary cDNA Sequences

[0074] Expression of specific sequences was assessed using Northern blotanalyses, as detailed in Example 6A. The Northern analyses wereperformed with cDNA fragments representing bins 1, 2, 3, 13, 16, 18, 22,23, and 28, on blots generated essentially as described in Example 6A orobtained commercially, typically from Clontech (Palo Alto, Calif.).

A. Human Acyl-CoA Synthetase

[0075] Fragments derived from the Human Acyl-CoA synthetase gene (SEQ IDNO:59) were used to assess the expression pattern of this gene usingNorthern blot in the following adult human tissues: spleen, thymus,prostate, testis, ovary, small intestine, colon and leukocytes. Theexperiments were also performed using fetal brain, fetal lung, fetalliver and fetal kidney. A prominent band was seen in all tissues atapproximately 1.8 kb. In addition, a 2.8 kB form was detected in testis,prostate, fetal liver and activated T-cells.

[0076] The expression pattern of human Acyl-CoA synthetase is differentfrom that of the rat brain Acyl-CoA synthetase, where two predominanttranscripts of 2.9 and 6.3 kb are observed predominantly in brain andheart, and to some extent, in adrenal tissue (Fujino and Yamamoto,1992). Based on these data, it is suggested that the human Long-chainacyl-CoA synthetase (LACS) gene described herein may be transcribed fromtwo different promoters and that it's alternative processing representsa ubiquitous mechanism for generation of multiple protein isoforms ortissue-specific regulation. LACS from different species have beenisolated and shown to play a critical role in fatty acid metabolism,acylation of many membrane proteins, and signal transduction.

[0077] The above results suggest that sequences derived from the humanAcyl-CoA Synthetase gene may be used as a marker for testis tissue.Further, the promoters from the human LACS gene, which may be isolatedusing standard methods (e.g., Ausubel, et al., 1988), may be used totarget expression of heterologous genes in testis tissue. Suchexpression may desirable, for example, in gene therapy approaches totesticular cancer.

B. Human RAD50 Homolog

[0078] Northern experiments were also performed using probes derivedfrom sequences (SEQ ID NO:54 and SEQ ID NO:55) from the human homologueof yeast Rad50. A non-coding 3′-flanking fragment of the genecorresponding to nucleotides 4333-5567 of SEQ ID NO:54 was used to probea Northern Blot containing RNA derived from the same set of adulttissues as described above. mRNA species of 1.9 and 0.85 kb weredetected in all tissues tested, with the strongest expression in testis,ovary and small intestine. Uniformity of RNA loading was confirmed usinga beta-actin probe.

[0079] Similar experiments employing a probe corresponding tonucleotides 417-4353 of SEQ ID NO:54 revealed two mRNA species. Astronger signal was observed at about 5.8 kb and much lower signal wasdetected at about 6.5 kb in all tissues, with strongest expression intestis. The results of RT-PCR and Northern blot analyses taken togetherconfirmed expression of the human gene in activated T-cells, B-cells,placenta and multiple fetal tissues, including fetal liver. Genomicequivalent of this gene is about 100 kb.

[0080] In yeast, Rad50 encodes major and minor transcripts of 4.2 and4.6 kb in length, respectively (Raymond and Kleckner, 1993).Steady-state levels of both transcripts increase during meiosis,reaching maximal levels midway during meiotic prophase. Yeast RAD50appears to be involved in DNA repair. It is required during vegetativegrowth for recombinational repair of double strand breaks and forefficient mating type switching, a direct recombination event promotedby a site-specific double strand break. Most S. cerevisiae mutants ofrad50 are deficient in repair of damage induced by X rays and in meioticrecombination.

[0081] The polypeptide predicted for S. cerevisiae Rad50 protein is 153kDa (1312 aa) (Alani, et al., 1989). The protein contains anamino-terminal ATP-binding domain. Inactivation of this site by pointmutations results in a null phenotype, and primary defects in meiosis.The remainder of the protein includes two long segments of heptad repeatsequence diagnostic of regions capable of forming alpha-helical coiledcoils, one of which is similar to the S-2 domain of the myosin heavychain. Since some mutations in the protein affect meiotic recombinationbut not the repair, it is likely that the protein has domains withdifferent roles.

[0082] It is contemplated that the human homologue of Rad50 describedherein (e.g, as represented by SEQ ID NO:144) plays a role in human DNArepair and may be a target for cancer related therapeutics. For example,since attenuating the function of Rad50 gene products may sensitizecells to DNA damage, they may be targets for therapeutic interventionsthat rely on DNA damage to differentially inhibit tumor survival.

C. Human Septin-2 Homolog

[0083] A fragment corresponding to nucleotides 203-1464 of SEQ ID NO:97was used to probe a Northern blot containing RNA from the same adulttissues listed above. The probe identified a faint band at 4.6 kb inlymph node, thymus, appendix, bone marrow and fetal liver. A nearfull-length cDNA containing −4.6 kb of the human gene was isolated usingMarathon RACE using primer designed based on sequence SEQ ID NO:97. ThecDNA encodes a protein that has higher homology to Septin-2 than to CDC3and that is 40 aa longer than Drosophila Septin-2.

[0084] In yeast, mutants of cell division cycle (cdc) gene 3 (cdc3) areincapable of forming an F-actin contractile ring. It is now believedthat cdc3 encodes a profilin that plays essential in cytokinesis, bycatalyzing the formation of the F-actin contractile ring(Balasubramanian, et al., 1994).

[0085] In Drosophila, Septin-2 is present at the bud neck during celldivision, and is required for cytokinesis: in pnut mutants, imaginaltissues fail to proliferate and instead develop clusters of large,multinucleate cells. Pnut interacts with a gene required for neuronalfate determination in the compound eye.

[0086] Computer analysis of the human septin protein described hereinhas identified several important motifs, such as NTP-binding site(ATP/GTP-binding loop) at N-terminus, a coiled-coil region and abipartite nuclear targeting site at C-terminus. These data suggest thatits role in signal transduction to the nucleus may be associated withcell division. The coiled-coil region may be involved in the formationof protein complexes and in chromosome condensation and disjunction inthe cell cycle.

[0087] In view of septin-2's involvement in cell proliferation, it iscontemplated that the human Septin-2 homolog, peptide represented by SEQID NO:143, may be a target for anti-cancer therapies and methods.Further, monitoring septin-2 expression by quantitative RT-PCR can beused as a diagnostic tool for measuring proliferative potential ofselected cell types.

D. Other Genes and Methods

[0088] It will be understood that human cDNA sequences isolated asdescribed herein may be characterized using any of a number of assaysknown to those of skill in the art, in addition to the expression assaysdetailed above. For example, functional assays particularly advantageousfor the characterization of immunomodulatory molecules (i.e., assayswhich may be used to further characterize the immunomodulatory activityof polypeptide compositions detailed herein) include proliferationassays (e.g., as described in Example 8), as well as assays based on thestimulation of expression of specific proteins in cell lines responsiveto the immunomodulatory molecules (e.g., cytokines) under study (e.g.,Thorpe, et al., 1992; Wadhwa, et al., 1992). Specifically, compounds orpolypeptides which inhibit, e.g., T-cell proliferation, may becharacterized as immunosuppressants, whereas compounds or polypeptideswhich stimulate, e.g., T-cell proliferation, may be identified asimmunostimulants.

[0089] In the case of polypeptides comprising receptors for, e.g., otherimmunomodulatory compounds, such as cytokines, standard methods may beemployed to express the receptors in a suitable host cell suitable foradditional experiments, such as binding assays or physiologicalexperiments.

[0090] Other methods of assaying expression may also be employed in thecharacterization of novel cDNA sequences isolated as described herein.For example, in situ hybridization may be used to perform cellularlocalization in tissues having comprised of distinct cell types. ThecDNA sequences presented herein may also be used to produce proteins(e.g., by cloning the sequences into an expression vector; Ausubel, etal., 1988). Such proteins may in turn may be employed to generateantibodies using standard methods (e.g., Harlow, et al., 1988) tolocalize the gene products at the cellular and sub-cellular levels.

V. Utility

[0091] Methods and compositions of the present invention are useful in anumber of applications. For example, they may be employed in cell typingapplications. In this aspect, the invention includes a method ofidentifying the presence of activated T-cells in a sample containing aplurality of different cell types. Experiments performed in support ofthe present invention The method includes performing a polymerase chainreaction amplification which employs an aliquot of the sample or anextract thereof as the amplification target. The reaction is conductedusing standards PCR conditions (Mullis, 1987; Mullis, et al., 1987) witholigonucleotide primers capable of selective amplification of apolynucleotide fragment having the sequence SEQ ID NO:151, to generatean amplification product having a specific size.

[0092] The selection of regions of a sequence suitable for serving astemplates for PCR primer design is well known in the art (e.g., Innis,et al., 1990). In fact computer programs designed specifically for thispurpose are commercially-available (e.g., “OLIGO” primer analysissoftware, NCBI, Inc., Plymouth, Minn.). An exemplary primer pair forsuch an amplification consists of primers having sequences SEQ ID NO:149and SEQ ID NO:150.

[0093] The size of the amplification is then determined using, forexample, agarose or polyacrylamide gel electrophoresis (see, e.g.,Ausubel, et al., 1988), and the observed size is compared with theexpected size. The detection of amplification product corresponds to theexistence of activated T-cells in the sample. The amount ofamplification product may be correlated with the number of activatedT-cells using a quantitative PCR approach (e.g., Piatak, et al., 1993;Vandevyver, et al., 1995).

[0094] The identification of activated T-cells in a sample is useful in,e.g., the diagnosis of diseases affecting activated T-cells or T-cellactivation, such as AIDS, rheumatoid arthritis, asthma, cystic fibrosis,atherosclerosis, ulcerative colitis, asthma and severe allergies.

[0095] Another utility enabled by the present disclosure is a method ofidentifying sequences encoding polypeptides having immunomodulatoryactivity. The method includes (i) selecting, by direct selection usingsequences specific for region 5q23-31 of human chromosome 5, cDNAfragments isolated from tissues or cells expressing cytokines, (ii)grouping the fragments into bins, where each bin represents cDNAfragments corresponding to a single gene or genetic locus, the groupingperformed by sequencing the fragments and/or mapping the fragments tolonger sequences derived from region 5q23-31 of human chromosome 5, and(iii) analyzing the tissue specificity of expression of transcriptscorresponding to the fragments (transcripts from the gene or locus whichthe fragments represent). In one embodiment, the first step is performedusing cDNAs obtained from cell lines and/or tissues expressingcytokines, such as activated T-cells. In another embodiment, the firststep is performed using cDNAs obtained from a chromosome 5-specificactivated T-cell cDNA library in lambda gt10, which was constructedusing a kit from Life Technologies, Inc. and is deposited at GenelabsTechnologies, Inc., Redwood City. In another embodiment, the analyzingof tissue-specific expression is carried out using sequence-specificprimers in a polymerase chain reaction amplification reaction containingtarget nucleic acids derived from tissues or cell lines of interest.Tissues which may be used in determining the tissue specificity ofexpression include total embryo, fetal liver, fetal brain, fetal muscle,placenta, adult heart, adult muscle, adult liver, adult brain, adultpancreas, adult kidney, adult aorta, adult spleen, adult testis, adultbone marrow, resting T-cells and activated T-cells.

[0096] The teachings of the present disclosure may also be employed in amethod of obtaining full-length sequences of genes or loci identified ashaving immunomodulatory activity. The method includes selecting adesired sequence identified in Table 1 and using the sequence to isolateoverlapping clones. In one embodiment, such overlapping clones areisolated using rapid amplification of cDNA ends (RACE) PCR with cDNAobtained from tissues or cell lines of interest or from a cDNA orgenomic DNA library. In another embodiment, the overlapping clones areisolated by direct hybridization screening of a cDNA or genomic DNAlibrary made from, for example, T-cells, a lymphoma or a leukemia.

[0097] As another example of a utility, the present invention includes amethod of identifying proteins having immunomodulatory activity. Themethod includes obtaining a full-length coding sequence of a generepresented by a sequence presented in Table 1 (e.g., as describedabove) and cloning the sequence into a recombinant expression vector.The resulting vector is then used to express recombinant polypeptides inselected host cells, such as E. coli. Expression vectors such asdescribed above typically contain control sequences, such as sequencescontaining promoter regions, enhancer elements, and the like, which arecompatible with the selected host cell. These control sequences areoperably linked to the insert sequence such that the insert sequence canbe expressed in the selected host cell.

[0098] One example of an expression vector for recombinant production oflatency-associated polypeptides is the plasmid pGEX (Smith, et al.,1985, 1988) and its derivatives (e.g., the pGEX series from PharmaciaBiotech, Piscataway, N.J.). These vectors express the polypeptidesequences of a cloned insert fused in-frame withglutathione-S-transferase. Recombinant pGEX plasmids can be transformedinto appropriate strains of E. coli and fusion protein production can beinduced by the addition of IPTG (isopropyl-thio galactopyranoside).Solubilized recombinant fusion protein can then be purified from celllysates of the induced cultures using glutathione agarose affinitychromatography according to standard methods (Ausubel, et al., 1988).

[0099] Alternatively, affinity chromatography may also be employed forisolating β-galactosidase fusion proteins, such as those produced bycloning latency-associated polypeptide sequences in lambda gt11. Thefused protein is isolated by passing cell lysis material over a solidsupport having surface-bound anti-β-galactosidase antibody.

[0100] Other suitable expression systems include a number of bacterialexpression vectors, such as lambda gt11 (Promega, Madison Wis.), pGEX(Smith, et al.), and pBS (Stratagene, La Jolla Calif.) vectors; yeastexpression systems, such as the Pichia expression kit from Invitrogen(San Diego, Calif.); baculovirus expression systems (Reilly, et al.;Beames, et al.; Clontech, Palo Alto Calif.); and mammalian cellexpression systems (Clontech, Palo Alto Calif.; Gibco-BRL, GaithersburgMd.).

[0101] A number of features can be engineered into the expressionvectors, such as leader sequences which promote the secretion of theexpressed sequences into culture medium. The recombinantly producedpolypeptides are typically isolated from lysed cells or culture media.

[0102] Isolated recombinant polypeptides produced as described above maybe purified by standard protein purification procedures, includingdifferential precipitation, molecular sieve chromatography, ion-exchangechromatography, isoelectric focusing, gel electrophoresis and affinitychromatography. Protein preparations can also be concentrated by, forexample, filtration (Amicon, Danvers, Mass.).

[0103] In addition to recombinant methods, latency-associated proteinsor polypeptides may be chemically synthesized using methods known tothose skilled in the art.

[0104] Polypeptides obtained as described above may be further evaluatedby methods known in the art of cytokines and interleukins. For example,the polypeptides may be tested in functional assays, such as cellproliferation assays and assays designed to monitor the activation ofgene expression in response to cytokine stimulation as described above.

[0105] It is further contemplated that polypeptides identified as havingimmunomodulatory activity may be employed in therapeutic applications toaugment, affect and/or correct the functioning of the immune system in asubject in need of such treatment.

[0106] In another example of the utility of the present invention, theteachings herein may applied in a method of identifying small moleculesthat affect alter and/or modulate the activity of immunomodulatoryproteins such as described above. The method includes assaying theeffects of a polypeptide having immunomodulatory activity in thepresence and absence of a test small molecule compound, and identifyingthe test compound as effective if the test compound is effective tosignificantly alter the effects of the polypeptide. In one embodiment,the small molecule compound is one of a plurality of such compoundspresent in a combinatorial library, such as one of a plurality of smallmolecules in a small molecule combinatorial library, or one of aplurality of peptides in a peptide combinatorial library. Small moleculecompounds include, but are not limited to, peptides, macromolecules,small molecules, chemical and/or biological mixtures, and fungal,bacterial, or algal extracts. Such compounds, or molecules, may beeither biological, synthetic organic, or even inorganic compounds, andmay be obtained from a number of sources, including pharmaceuticalcompanies and specialty suppliers of libraries (e.g., combinatoriallibraries) of compounds.

[0107] The following examples illustrate but in no way are intended tolimit the present invention.

Materials and Methods

[0108] Unless otherwise indicated, restriction enzymes and DNA modifyingenzymes were obtained from New England Biolabs (Beverly, Mass.) orBoehringer Mannheim (Indianapolis, Ind.). Nitrocellulose paper wasobtained from Schleicher and Schuell (Keene, N.H.). Materials forSDS-polyacrylamide gel electrophoresis (SDS-PAGE) were obtained fromBio-Rad Laboratories (Hercules, Calif.). Other chemicals were purchasedfrom Sigma (St. Louis, Mo.) or United States Biochemical (Cleveland,Ohio).

A. Buffers and Media

[0109] Phosphate-Buffered Saline (PBS)

[0110] 10×stock solution, 1 liter:

[0111] 80 g NaCl

[0112] 2 g KCl

[0113] 11.5 g Na₂HPO4-7H₂O

[0114] 2 g KH₂PO₄

[0115] Working solution, pH 7.3:

[0116] 137 mM NaCl

[0117] 2.7 mM KCl

[0118] 4.3 mM Na₂HPO₄-7H₂O

[0119] 1.4 mM KH₂PO₄

[0120] SSC (Sodium Chloride/Sodium Citrate), 20x

[0121] 3 M NaCl (175 g/liter)

[0122] 0.3 M Na₃citrate-2H₂O (88 g/liter)

[0123] Adjust pH to 7.0 with 1 M HCl

[0124] SSPE (Sodium Chloride/Sodium Phosphate/Edta), 20x

[0125] 3.0 M NaCl

[0126] 0.20 M NaH₂PO₄

[0127] 20 mM EDTA, pH 7.4

[0128] Tris/EDTA Buffer (TE)

[0129] 10 mM Tris-Cl, pH as indicated

[0130] 1 mM EDTA, pH 8.0

[0131] AHC Medium and Plates (Ura⁻, trp⁻)

[0132] 1.7 g yeast nitrogen base without amino acids and withoutammonium sulfate (Difco Laboratories, Detroit, Mich.).

[0133] 5 g ammonium sulfate.

[0134] 10 g casein hydrolysate-acid, salt-free and vitamin-free (UnitedStates Biochemical, Cat. #12852, Cleveland, Ohio).

[0135] 50 ml (for medium) or 10 ml (for plates) of 2 mg/ml adeninehemisulfate (Sigma Chemical, Cat. #A-9126, St. Louis, Mo.).

[0136] Dissolve in a final volume of 900 ml H₂O, adjust pH to 5.8.

[0137] Autoclave 30 min, then add 100 ml sterile 20% (w/v) glucose. ForAHC plates, add 20 g agar prior to autoclaving. Store at 4° C. for ≦6weeks.

[0138] Denhardt Solution, 100x

[0139] 10 g Ficoll 400

[0140] 10 g polyvinylpyrrolidone

[0141] 10 g bovine serum albumin (Pentex Fraction V, Miles Laboratories,Kankakee, Ill.)

[0142] H₂O to 500 ml

[0143] Filter sterilize and store at −20° C. in 25-ml aliquots

EXAMPLE 1 Construction of cDNA Pools for Use in Direct Selection

[0144] Complementary DNA (cDNA) was prepared using standard methods fromtissues and cell lines that expressed or were likely to expresssufficient amounts of messenger RNA (mRNA) encoding proteins ofinterest. cDNA samples from several sources were sometimes grouped into“cDNA pools”. For example, ionomycin-stimulated T cells, T cell clones,and T-lineage lymphomas were found be the best mRNA source forconstruction of a polymerase chain reaction (PCR)-amplifiable cDNA poolfor direct selection due to high levels of corresponding cytokinesexpressed (first eight samples in Table 3, below). Similarly, a hybridcDNA pool, termed pool #1, was constructed using mRNA isolated from amixture of several activated T-cell clones and lymphomas (obtained fromDavid Lewis, University of Washington, Seattle; Lewis, et al., 1988).

[0145] A complex primary cDNA pool, termed pool #2, was constructed fromhuman fetal and adult tissues, including fetal brain and liver, adultbone marrow, and activated lymphocytes, as well as the followingcytokine-producing cell lines, which, unless otherwise indicated, wereobtained form the American Type Culture Collection (ATCC), RockvilleMd.: A-10 cells (T cell clone), Jurkat cells (ATCC TIB-152), CEM cells(ATCC CCL-119) , HUT-78 cells (ATCC TIB-161), JM cells (ATCC CRL-8294),Molt-4 cells (ATCC accession number CRL1582) and NG-1 cells.

[0146] Prior to isolating mRNA from “activated” T-cell samples, thecells were grown at 5×10⁶ cells/ml in RPMI medium (GIBCO/BRL LifeTechnologies) supplemented with 5% human AB serum as previouslydescribed (Georgopoulos, et al., 1990) and activated using 50 ng/mlphorbol myristate acetate (PMA; Sigma, St. Louis, Mo.) in combinationwith either 25 μg/ml concanavalin A (Con A) (Pharmacia, Piscataway,N.J.) or 0.5 μM ionomycin (Calbiochem-Behring, San Diego, Calif.).

A. Cell Isolation and Synthesis of cDNA

[0147] 1. Isolation of Primary T Cells and Thymocytes.

[0148] Circulating adult T cells and thymocytes were isolated aspreviously described (Georgopoulos, et al., 1990) by Ficoll-Hypaquedensity gradient centrifugation and treated with CD4 Lymphokwik (OneLambda, Los Angeles, Calif.), a mixture of complement and monoclonalantibodies (mAb) directed against non-T-lineage markers and the CD8surface antigen, following the manufacturer's instructions. The finalpurity of each T-lineage cell population was consistently >95% based onflow cytometric analysis after staining with appropriate mabs.

[0149] 2. Cell Activation.

[0150] Cells were activated at 5×10⁶/ml in RPMI medium supplemented with5% human AB serum as previously described (Georgopoulos, et al., 1990)using 50 ng/ml phorbol myristate acetate (PMA; Sigma, St. Louis, Mo.) incombination with either 25 μg/ml concanavalin A (ConA) (Pharmacia,Piscataway, N.J.), 0.5 μM ionomycin (Calbiochem-Behring, San Diego,Calif.), or 2.5 μg/ml PHA (Sigma, St. Louis, Mo.).

[0151] 3. RNA Isolation.

[0152] Cell or tissue homogenates were prepared using a Polytronhomogenizer as described (Chomczynski and Sacchi, 1987). Total RNA wasisolated by the guanidinium isothiocyanate/CsCl method (Glizin, et al.,1974) or by the acid guanidinium isothiocyanate-phenol-chloroformextraction method (Chomczynski and Sacchi, 1987) using a commercial kit(“TRIZOL”, Life Technologies, Inc., Gaithersburg, Md.). mRNA wasisolated from the total RNA using oligo(dT)₂₅ “DYNABEADS” (Dynal Inc.,Lake Success, N.Y.) following manufacturer's instructions (“mRNAIsolation Using “DYNABEADS” OLIGO(dT)₂₅”, pp 35-60 in BIOMAGNETICTECHNIQUES IN MOLECULAR BIOLOGY—TECHNICAL HANDBOOK, Second Edition,Dynal, A. S. (Oslo, Norway) (1995) . Briefly, Poly-A⁺ mRNA was selectedusing “MAGNETIC DYNABEADS OLIGO (dT)₂₅” (Dyndal A. S., Oslo, Norway)according to protocol 2.3.1 (Jakobsen, et al., 1990, 1994) asrecommended by the manufacturer.

[0153] 4. cDNA Synthesis.

[0154] Cell or tissue Double-stranded (ds) cDNA was synthesized usingthe “SUPERSCRIPT” “CHOICE SYSTEM” kit for cDNA synthesis (GIBCO/BRL LifeTechnologies, Gaithersburg, Md.) according to the manufacturer'sinstructions, except that custom adapters (Adapter #3 and adapter #5,described below) were used in place of the EcoR1 adapters supplied withthe kit. Approximately 5 μg of poly(A⁺) mRNA were used with oligo dT15or random hexamers to synthesize ds cDNA. The cDNA was purified from theprimers and the low molecular weight products (<250 bp) on “WIZARD” PCRPreps DNA Purification System columns (Promega, Madison, Wis.) accordingto the manufacturer's protocol, and ligated to dephosphorylated adapters#3 (SEQ ID NO:1, SEQ ID NO:2) or #5 (SEQ ID NO:3, SEQ ID NO:4) usingstandard methods (Sambrook, et al., 1989). Typically, cDNA poolsdesigned for direct selection contained adapter #3 at their ends toallow single primer PCR amplification (e.g., using primer #A3-2 (SEQ IDNO:5) or primer #AD3-CUA (SEQ ID NO:6; see below).

[0155] The adapters were made by combining oligonucleotides #4665 (SEQID NO:1) and #4666 (SEQ ID NO:2) (Adapter #3), or oligonucleotides #A5-1(SEQ ID NO:3) and #A5-2 (SEQ ID NO:4) (Adapter #5), heating the mixturesto 95° C. for 5 minutes, and allowing the mixtures to gradually cool toroom temperature over about 30 minutes. This caused the oligonucleotidesin the mixtures to hybridize and form double stranded adapters with 3′overhangs as illustrated below. The adapters were then dephosphorylatedwith calf intestine phosphatase (CIP) using a standard protocol(Ausubel, et al., 1988), and the phosphatase inactivated by incubatingat 70° C. for 10 min.

[0156] 5′-biotinylated primer #A5-2b (SEQ ID NO:7) was designed tosynthesize biotinylated subtraction probes (e.g., ribosomal,mitochondrial, Alu-, etc.) from cDNA fragments containing Adapter #5using PCR. Primer #A3-2 (SEQ ID NO:5) was designed to synthesize similarprobes from cDNA fragments containing Adapter #3. CUA-containing primer#AD3-CUA (SEQ ID NO:6) was designed to PCR amplify cDNAs that subclonedinto the pAMP10 vector (GIBCO/BRL Life Technologies, Inc).

B. Screening of cDNA Samples with Cytokine PCR Primers

[0157] The presence of specific cytokine cDNAs in the different cDNAsamples/pools was determined using PCR to provide an estimate of thedegree to which such cytokine transcripts were present, i.e., to“validate” the cDNA samples/pools as sources for cytokine cDNAs. The PCRreactions were carried out using standard methods (Mullis, 1987; Mullis,et al., 1987) with the primer pairs presented in Table 2, below. TABLE 2SEQ ID Product Primers NO: T_(ann) Sequence Size J GM-CSF-2 8 60°CCTTGACCATGATCGCCAGCC  187 bp GM-CSF-1 9 C. CCCGGCTTGGCCAGCCTCATC IL3-110 55° CTCTGTGGTGAGAAGGCCCA  287 bp IL3-2 11 C. CTTCGAAGGCCAAACCTGGAIL4-3 12 55° GGTTTCCTTCTCAGTTGTGTT  210 bp IL4-4 13 C.CTCACCTCCCAACTGCTTCCC IL5-1 14 55° CACCAACTGTGCACTGAAGAAATC  213 bpIL4-2 15 C. CCACTCGGTGTTCATTACACC IL9-1 16 60° AGCTTCTGCCCATGGTCCTTAC 360 bp IL9-6 17 C. TCAGCGCGTTGCCTGCCGTGGT IL13-1 18 55°ATGGCGCTTTTGTTGACCAC 1013 bp IL13-5 19 C. CCTGCCTCGGATCAGGCTCC IRF1-7 2055° GAAGGCCAACTTTCGCTGTG  367 bp IRF1-8 21 C. CACTGCGATGTCCCAGTCGGTCF7-3 22 55° CCGTTCCTTCCGATGACAGTGCT  898 bp TCF7-4 23 C.GACATCAOCCACAAGCAAGT EGRI1-6 24 60° CCACCTCCTCTCTCTCTTCCTA  750 bpEGRI1-7 25 C. TCCATGGCACAGATGCTGTAC CD14-5 26 65°CCGCTCGTGCACGTCTCTGCGACC 1022 bp CD14-6 27 C. CACGCCGGAGTTCATTGAGCCCDC25-3 28 65° GAGAGGAAGGAAGCTCTGGCTC  282 bp CDC25-5 29 C.GTCCTGAAGAATCCAGGTGACC

[0158] All cDNA samples and cDNA pools #1 and #2 were screened using PCRwith the above primers, and the relative amount of specificamplification product determined. Prior to amplification, the sampleswere diluted such that the concentration of cDNA was the same in eachsample (about 200 μg/ml). The results for individual cDNA samples arepresented in the Table 3, below. Three pluses (+++) indicate arelatively high level of expression, (++) an intermediate level, (+) arelatively low level, (±) a very low but consistent level, (∓) a verylow and inconsistent level, and (−) no detectable expression. TABLE 3cDNA IL13 IL4 IRF1 IL5 IL3 GM-CSF TCF7 IL9 EGR1 CD14 CDC25 T-cells +++++ +++ +++ +++ ++++ +++ +++ − ± − A-10 +++ ++++ +++ ++++ +++ ++++ ± +++++ ± ∓ Jurkat + + +++ − +++ +++ +++ − − ± + CEM ++ ++ +++ ∓ +++ +++ +++31 ++ ± ++ HUT78 ++ ++ +++ + +++ ++++ +++ 31 +++ ± ++ JM + − +++ ∓ +++++++ +++ − − ± +++ Molt4 + ++ +++ − − − +++ − − ± ++ HNG-1 ++ − ++= − −++++ +++ − +++ ± ++ Daudi − − + − − − ± − − ∓ + 816 − +++ +++ ± +++ − −∓ Mono − − +++ − − +++ − − − +++ − # T-cell origin, producing high levelof IL4 and IL5 and stimulated with Con A for 6 hrs (A-10)

[0159] The data in Table 3, above, suggest that a cDNA pool formed ofcDNA samples in the first 6 rows of the table, along with the monocytecDNA, may be particularly effective as a source of cytokine cDNAs.Accordingly, cDNA pool #3 was formed by combining equal fractions ofthese seven cDNA samples. cDNA pool #4 was formed by combining equalfractions of all eleven cDNA samples listed in Table 3, above, alongwith cDNA from adult bone marrow.

[0160] Eight additional cDNA pools, termed cDNA pools #5-12, wereconstructed by combining, at a 1:1 vol/vol ratio, cDNA pool #3 with cDNAsamples of similar concentrations isolated from human tissues, includingtotal embryo (6, 8, 12 weeks of gestation; pools #5, 6 and 7,respectively), fetal liver (pool #8), fetal brain (pool #9), adult bonemarrow (pool #10), adult thymus (pool #11), and adult spleen (pool #12).

EXAMPLE 2 Preparation of Genomic DNA for Direct Selection A. Mapping ofGenomic Clones used for Direct Selection

[0161] Yeast artificial chromosome (YAC) clones containing sequencesfrom the cytokine gene cluster area of chromosome 5 (5q23-31) wereisolated and physically mapped to provide a template for the directselection of the cDNA samples and pools described in Example 1. YACclone A94G6 (˜425 kb) was obtained from the YAC Washington Universitylibrary (St. Louis, Mo.) (Burke, et al., 1987; Morgan, et al., 1992).Clones 259E7 (˜490 kb) and 854G6 (˜1.3 mb) were isolated from CEPHregular and mega YAC libraries (Bellanne-Chantelot, et al., 1992).

[0162] To construct a physical map of the YAC clones, the clones weredigested with NotI and run on a clamped homogeneous electrical fields(CHEF) mapper system (“CHEF-DR III” Variable Angle Pulsed FieldElectrophoresis System, Bio-Rad Laboratories, Hercules, Calif.). Theyeast clones were grown in liquid AHC medium (Bellanne-Chantelot, etal., 1992) for 48 hrs at 30° C. Cells were harvested, washed andembedded in 0.5% low melting temperature agarose (LMT) as described(Chumakov, et al., 1992). After the zymolase treatment and lysis, YACswere separated in 1% LMT agarose pulsed field gels in 0.5×TBE at 14° C.as described below.

[0163] All the separations were carried out in “CHEF-DR III”pulsed-field electrophoresis system (Bio-Rad) with followingparameters: 1) small YACs (400-500 kb)—power 06 V/cm; run time 24 hrs 4min; initial switch time 21.41; final switch time 39.48; 2) mega YACs(1-1.5 mb)—power 0.6V/cm; run time¹ 22 hrs 30 min; switch time¹ 60.00;run time² 12 hrs 30 min; switch time² 90.00.

[0164] The CHEF gels were blotted and hybridized by standard Southernhybridization (Sambrook, et al., 1989) to probes for IL13, IL4, IL5,IRF1, IL3, GM-CSF, all of which are located in 5q23-31. Thehybridization conditions, unless specified, were: 5×SSPE, 0.1% SDS,5×Denhardt's, ³²P-labelled probe, 65° C. overnight. The blots werewashed first with 1×SSC+0.1% SDS at room temperature, and then with0.1×SSC+0.1% SDS at 65° C. several times, 15 min each.

[0165] The results from the hybridizations were used to construct aphysical map of the 1.3 megabase (Mb) region encompassed by YACs A94G6,259E7 and 854G6, which is presented in FIG. 1. This map was confirmedand further refined by physically mapping a panel of chromosome 5-specific cosmids, as described in Example 5, below.

B. Direct Selection Protocol

[0166] DNA from the genomic clones was isolated as described in part C,below. The isolated DNA was labeled with biotin either by PCR usingbiotinylated primers SEQ ID NO:34 and SEQ ID NO:35, or by conventionallabelling technique. For PCR labelling 5′-biotinylated primers were usedthat had been synthesized at Genosys Biotechnologies, Inc. (Woodlands,Tex.). For conventional labelling either photo-activatable biotin (PAB)or Biotin-21-dUTP nick translation labelling kits from Clontech (PaloAlto, Calif.) were used.

[0167] Biotinylated genomic DNA was hybridized in solution with complexrepresentative cDNA pools #4-12. In selection with YACs A94G6 and 259E7,cDNA pool #4 was used. In selection with the mega YAC 854G6, a mixtureof equal amounts of cDNA pools #4-12 was used. Hybridization was done at65° C. in 20 μl of 5×SSPE, 1×Denhardt, 0.1% SDS to Cot=500. cDNAs thatwas close to saturation was efficiently-captured under these conditions.Specifically-bound cDNAs were captured with Dynal streptavidin beads andwashed with 400 μl of 2×SSC, 0.5% SDS twice at RT, 10 min each and 4times with 400 μl of 0.2×SSC+0.1% SDS at 65° C., 5 min each time.

[0168] Biotinylated genomic DNA-cDNA hybrids and free YAC DNA fragmentswere captured with streptavidin coated magnetic beads (Dynal A. S.,Oslo, Norway) for 30 min at RT with occasional tapping. Two hundred μgof the beads (40 μl, 5 μg/μl) were added per each 5 pMoles ofbiotinylated PCR product (up to 4 kb in length). About 4 μg of thebiotinylated PCR products within the range of 1-4 kb could be capturedby this amount of beads. Dynabeads were washed twice with buffercontaining 1M NaCl in preblocking buffer (TE pH 7.5+200 μg/ml Herringsperm DNA+0.1% BSA) and resuspended to 5 μg/μl in the same bufferwithout DNA or RNA. The suspension was incubated at room temperature(RT) for 30 min, and the beads were captured and isolated with the aidof a magnet. The isolated beads were then washed with 400 μl of 2×SSC,0.5% SDS twice at RT, 10 min each and 4 times with 400 μl of0.2×SSC+0.1% SDS at 65° C., 5 min each time.

[0169] After washing, specifically bound cDNAs were eluted from thehybrids of the biotinylated DNA-cDNA either by incubating the beads with40 μl of 2.5 mM EDTA at 80° C. or with 100 mM NaOH at RT. The latter wasfollowed by neutralization with 20 μl of 0.2 M HCl and 10 μl of 1MTris-HCl pH 8.0.

[0170] Eluted cDNAs were PCR amplified by single primer amplification(SISPA) using either primer #AD3-2 (SEQ ID NO:5) or #AD3-CUA (SEQ IDNO:6). Primer #AD3-CUA was used when PCR products were to be cloned inpAMP10. This cloning system substantially reduced the background of“0”-insert and chimeric clones.

[0171] A second round of direct selection was usually performedfollowing completion of the first round. The first round typicallyresulted in a several hundred to a thousand fold enrichment. The secondround of selection enabled enrichment up to about ahundred-thousand-fold (Morgan, et al., 1992).

[0172] To determine whether a second round of selection was necessary,cDNA aliquots were SISPA-propagated the #AD3-2 primer (SEQ ID NO:5),cleaned up by “WIZARD” PCR column chromatography, quantitated, and run aon 1% agarose gel (about 1 μg/lane) both before and after selection. Thegels were visualized, blotted, and hybridized with the probes known toreside within given genomic DNA. Alternatively, PCR was used to assessthe enrichment by direct selection (Morgan, et al., 1992). If the degreeof enrichment was less than about ten thousand-fold, a second round wasperformed.

C. Preparation of YAC DNA for Direct Selection

[0173] YAC clones A94G6, 259E7 and 854G6 were grown overnight in AHCmedium at 30° C. Agarose blocks were prepared according to the protocolof LePaslier (Chumakov, et al., 1992). Briefly, yeast cells harboringthe YACs were harvested, washed, counted and embedded in 0.5% Sea-PlaqueGTG agarose (FMC, Rockland, Main) as described in CHEF-DR^(R)IIIinstruction manual and application guide. YAC DNAs or their restrictionfragments were separated in 1% LMT agarose (FMC) pulsed field gels in0.5×TBE at 14° C. according to the Bio-Rad protocols. For smaller YACs(400-500 kb), the following parameters were applied: power 0.6 V/cm; runtime 24 hrs 4 min; initial switch time 21.41; final switch time 39.48.For mega YACs (1-1.5 mb), the following parameters were applied: power0.6V/cm; run time¹ 22 hrs 30 min; switch time¹ 60.00; run time² 12 hrs30 min; switch time² 90.00.

[0174] YAC DNA-containing bands (containing ˜250 ng DNA) were excised,placed into tubes with 2 vol of 1×Sau3AI buffer (New England Biolabs(NEB), Beverly, Mass.), and treated with 12 U of Sau3AI (NEB) at 37° C.for 5 hrs.

[0175] The agarose containing the digested YAC DNA was then melted in 1volume of TE at 68° C., and the DNA isolated using the “WIZARD” PCRPreps DNA Purification System (Promega, Madison, Wis.) at 37° C.following the manufacturer's instructions. DNA was eluted with TE (pH8.0).

[0176] Due to steric hindrance of the incorporated biotin, one of thefollowing adapters was ligated to the eluted YAC DNA to allow moreefficient SISPA amplification and PCR controlled labelling with biotin:(i) Sau3A1 semiadapter #1, made of primers having sequences SEQ ID NO:52and SEQ ID NO:53, (ii) Sau3A1 semiadapter #2, made of primers havingsequences SEQ ID NO:30 and SEQ ID NO:31, or (iii) Sau3A1 adapter#S-1/S-2, made of primers having sequences SEQ ID NO:32 and SEQ IDNO:33. Sau3A1 semiadapter #2 provided better yields and specificity inligations and subsequent PCRs.

[0177] Ligation of the linkers was typically carried overnight at +14°C. in 20 μl of the reaction mix, containing 100 ng of Sau3AI-digestedYAC DNA, 100 pmoles of adapter, and 6 U of T4 DNA Ligase (New EnglandBiolabs).

EXAMPLE 3 Direct Selection with the Genomic DNA Fragments EncompassingCytokine Gene Cluster in 5q23-31

[0178] YAC clone DNA was PCR-amplified for 30 cycles using biotinylatedprimers SEQ ID NO:34 and SEQ ID NO:35. The amplified YAC DNA was thenpreblocked with Cot1 DNA (GIBCO/BRL Life Technologies, Inc.) and usedfor direct selection with cDNA samples as follows.

[0179] One hundred ng of the amplified biotinylated YAC DNA were mixedwith 5 μg Cot1 DNA and 5 μg yeast host strain AB1380 in 8 μl of waterand denatured for 15 min under mineral oil at 98° C. in a heating block.The mixture was then supplemented with 2 μl of 25×SSPE+5×Denhardt+0.5%SDS to a final concentration of 5×SSPE, 1×Denhardt solution and 0.1sodium dodecyl sulfate (SDS) in 10 μl, and hybridized for 2.0 hrs at 60°C. to Cot=20. In parallel, 10 μg of cDNAs were denatured in 8 μl ofwater for 15 min under mineral oil and treated as described above.

[0180] Ten μg of cDNAs from selected samples were denatured in 8 μl ofwater for 15 min under mineral oil as described above and supplementedto a final concentration of 5×SSPE, 1×Denhardt and 0.1% SDS. Directselection was initiated by mixing 10 μl of the amplified cDNAs with 10μl of the amplified and preblocked biotinylated YAC DNA (100 ng), andhybridization was conducted to a Cot=500 (about 40 hrs) at 65° C. undermineral oil. A Cot value of 1 is equivalent to 83 μg/ml of DNA×1 hour at60° C. in 5×SSPE.

A. Isolation of cDNA/DNA Hybrids with Magnetic Beads

[0181] The hybridization mixture was then incubated with streptavidincoated magnetic beads (Dynal, Inc., Lake Success, N.Y.) in a buffercontaining 1M NaCl in TE pH 7.5+0.1% BSA for 30 min at room temperaturewith occasional tapping to immobilize the biotinylated genomic DNAfragments, some of which contained hybridized cDNA species. Two hundredμg of the beads (40 μl, 5 μg/μl), effective to capture about 4 μg of thebiotinylated PCR products (1-4 kb), were added per each 5 pMoles ofbiotinylated YAC DNA PCR product.

[0182] Following the incubation, the “DYNABEADS” were collected using amagnetic stand (Dynal, Inc.). The beads were then washed with 400 μl of2×SSC, 0.5% SDS twice at RT, 10 min each, and 4 times with 400 μl of0.2×SSC+0.1% SDS at 65° C., 5 min each. Specifically bound biotinylatedDNA-cDNAs were incubated either with 40 μl of 2.5 mM EDTA at 80° C., orwith 100 mM NaOH at RT with occasional tapping of the tube, eluted andneutralized with 20 μl of 0.2 M HCl and 10 μl of 1M Tris-HCl pH 8.O.

[0183] Specifically bound biotinylated DNA-cDNAs were eluted either with40 μl of 2.5 mM EDTA at 80° C. or with 100 mM NaOH at RT with occasionaltapping of the tube. In cases where NaOH was used, the eluted beads wereneutralized with 20 μl of 0.2 M HCl and 10 μl of 1M Tris-HCl pH 8.O.

B. Subcloning of Selected cDNAs

[0184] The eluted material (˜2 μl) was PCR-amplified for approximately30 cycles in 100 μl tubes using approximately 50 pmoles each of primersSEQ ID NO:5 and SEQ ID NO:6, typically for 30 cycles using 2 μl of theeluate per 100 μl reaction. Primer SEQ ID NO:6 was used only when thePCR products were to be subcloned into the pAM10 vector. The PCR cycleparameters were as follows: 30 sec at 94° C., 30 sec at T_(ann)−5° C.,and 2 min at 72° C. After the last cycle, the reactions were incubatedfor 7 min at 72° C., and then kept at 4° C. until further processing.

[0185] The PCR-amplified material was typically used for a second roundof direct selection as described above, selected products were PCRamplified with primer SEQ ID NO:6, and -1-5 μg of the selected cDNAswere subcloned into the pAMP10 vector (“CLONEAMP” directional PCRcloning system, GIBCO/BRL Life Technologies, Inc), which is adapted foruracil DNA glycosilase (UDG) cloning. This approach does not requirerestriction endonuclease digestion, end-polishing, purification orligation. With this system, PCR products should contain specified12-base 5′ sequence that contains dUMP residues instead of dTMP.

[0186] Treatment with UDG renders dUMP residues abasic, disruptingbase-pairing which results in 3′-protruding termini. pAMP10 plasmidcontains a modified multiple cloning site and 3′ ends that arecomplementary to the 3′ protruding termini of the UDG-treated PCRamplification products obtained with the primer SEQ ID NO:6. Linearvector and UDG both go to the selected amplified cDNAs, without ligase,and are complete in less than 30 min, producing recombinant moleculesready for transformation.

[0187] 1 μl of 20 μl UDG-reaction mixture was typically used toelectroporate 50 μl of electrocompetent JS5 E. coli cells (Bio-Rad)according to manufacture's protocol in a “GENE PULSER” apparatus(Bio-Rad), in 0.1 cm electrode gap cuvettes. After 1 hr incubation ofelectroporated cells in 1 ml of Luria Broth (LB), 100 μl of the culturewas plated onto LB plates containing 100 μg/ml Ampicillin.

[0188] The quality of a direct selection was monitored by Southern blothybridization using a probe known to reside on the YAC, when similarquantities of the PCR amplified cDNA were loaded on the gel before andafter the selection. Usually up to 100,000—fold enrichment was observedin two rounds of selection. Before and after the selection cDNA aliquotswere SISPA-propagated with the primer SEQ ID NO:5, cleaned up by WizardPCR column chromatography, quantitated, and run in 1 agarose gel (about1 μg/lane). The gels were visualized, blotted, and hybridized with theprobes known to reside within given genomic DNA. Alternatively,quantitative PCR was used to assess the enrichment by direct selection.The enrichment ratios of direct selection were also monitored by platingcDNA aliquots before and after the selection, and counting the ratio ofseveral marker clones to overall colonies. For example, if there was oneIL3 positive clone in 10⁶ colonies before and one in 10 after theselection, the enrichment was considered to be around 10⁵ fold on thisstep. The selection process was controlled such that there was at leasta 10 thousand-fold enrichment for at least one marker. Alternatively,negative selection was controlled for the markers known not to be on theYACS. In this case, the data were examined for a decrease in the ratioof this gene during selection.

EXAMPLE 4 Hybridization and Sequence Analysis of the ArrayedRegion-Specific cDNAs A. Analysis and Subcloning of the PCR Products

[0189] Individual colonies of PCR pAMP10 clones generated as describedabove were used to inoculate wells containing LB broth in 96-wellplates. The cultures were incubated overnight at 37° C. and an aliquotfrom each well was transferred to an Immobilon-N membrane (Millipore,Bedford, Mass.), forming a grid corresponding to the locations of thesamples in the plate.

[0190] The DNA was immobilized on the membranes using UV-crosslinking,and the membranes were then screened with ³²P-labelled YAC, Cot1,mitochondrial, ribosomal and single copy probes known to reside on astarting genomic clone, in order to eliminate nonspecific or alreadyknown cDNAs from further analysis, as follows.

[0191] Membranes with the arrayed cDNAs were hybridized with different³²P-labelled probes: highly repetitive, high molecular weight human COT1DNA (Life Technologies, Inc.), human mitochondrial and ribosomal probes,starting YAC probe, single copy marker genes, known to reside within thegenomic region in question. Because starting total cellular RNAcontained certain amount of heteronuclear, ribosomal and mitochondrialspecies, final cDNA pools still contained these species, and it was mucheasier to prescreen the arrayed libraries for them rather than tointroduce additional steps into the selection protocol.

[0192] About 55% of the clones in arrayed selected cDNA libraries wereeliminated in such a prescreening procedure. Single copy known genesfrom the genomic region in question were monitored as well, and wereused to evaluate the quality of the selected material and the depth ofthe libraries. These statistics also aided in determining how many novelcDNAs might be expected. For instance, 18% of the clones in the A94G6YAC selection library belonged to IRF1, IL13, IL3 and IL5.

[0193] Negative clones were subject to sequencing. Sequencing dataconfirmed that there were at least 7 novel gene candidates, one of whichwas assembled into a full-length clone of a human homolog of S.cerevisiae RAD50. After computer analysis of the sequencing data, PCRprimers were designed for prospective novel gene cDNAs and were usedboth to evaluate the tissue-specificity of expression of the genecandidates and for physical mapping of cDNAs to human chromosome 5 andthe starting YAC, as described below.

B. Sequence Analysis

[0194] Unique and presumably novel cDNA clones were sequenced andscreened for similarity of their nucleotide and amino acid sequencesusing Fasta, BlastN, BlastX, tBlastN programs in known protein andnucleic acid databases. For efficient and quick identification ofnon-overlapping cDNAs, redundant cDNAs were eliminated by subsequenthybridization of the arrayed libraries with already identifiedindividual cDNAs as probes and unique sequences were further analyzed asdescribed below.

[0195] After two rounds of selection, ˜66% of all clones mapped back tothe starting genomic region, i.e., YAC or any other genomic DNA used toselect these particular cDNAs. Each cDNA species comprised >1% of theselected material. The complexity of the selected cDNAs (i.e., thenumber of distinct species of DNAs) was dependent on the gene density inthe region with respect to which the cDNAs were selected, and on thecomplexity of the starting cDNA sources.

EXAMPLE 5 Mapping Selected Clones to Chromosome 5: Physical Mapping ofcDNAs to Cosmids

[0196] A human chromosome 5-specific cosmid library was obtained from L.Deaven (Los Alamos National laboratories, N.M.) as arrayed individualclones in 96 well-plates that represented 8×genome equivalents subclonedin the sCos1 vector (Longmire, et al., 1993). The E. coli DH5 clonescontained about 81% human inserts, 8% rodent inserts and 3%nonrecombinants. About 25,000 individual cosmid clones were microgriddedonto “HYBOND-N” nylon membrane (Amersham Life Sciences, UK) using a“BIOMEK 1” (Beckman, Palo Alto, Calif.) robotic station. The filterswith spotted clones were grown overnight on 96-well plate lids(Cat.#76-205-05, ICN Flow, Costa Mesa, Calif.) filled with 1.5% LBSeaKem GTG agarose (FMC Bioproducts, Rockland, Main) supplemented with20 μg/ml kanamycin (Sigma).

[0197] After treating the filters on Whatman 3 mm paper saturated with2×SSC/0.5% SDS for 2 min, the filters were microwaved for 2.5 min at˜750 W until dry. Then they were submerged in a buffer containing 50 mMTris-HCl pH 8.0, 50 mM EDTA, 100 mM NaCl, 1% Na-lauryl-sarcosine, and250 mg/ml Proteinase K (Boehringer). After incubation for 20 min at 37°C. the filters were UV-crosslinked on Fotodyne crosslinker for 35 sec.After washing, the microgrids were hybridized with different³²P-oligolabelled YAC, cDNA or terminal cosmid walking probes asdescribed below. Many cDNAs selected with the above specified YACs weremapped to the clones on the microgrids. Other libraries may be similarlyused for mapping purposes, including YAC, BAC, and P1 genomic libraries.

EXAMPLE 6 Determining Tissue Specific Expression

[0198] Tissue specificity of expression was performed using Northernblot analyses and PCR detection.

A. Northern Blot Analyses

[0199] Total RNA was isolated by the guanidinium isothiocyanate/CsClmethod (Glisin, et al., 1974) or by the acid guanidinium isothiocyanate-phenol-chloroform extraction method using a commercial kit (Tri-reagent,Molecular Research Center, Cincinnati, Ohio), and was resolved onformaldehyde gels using standard methods (Sambrook, et al., 1989). Thegels were blotted onto “HYBOND N” membranes (Amersham Life Sciences,UK), fixed by UV-crosslinking, and the membranes probed withradiolabeled probes corresponding to the clones. Conditions:Hybridization buffer, containing 5×SSPE, 2×Denhardt, 100 μg/ml sonicatedsalmon sperm DNA, 0.5% SDS; Hybridization temperature =65° C.

[0200] All probes consisted of DNA labeled by the random hexamer primingmethod using a commercial kit (Pharmacia, Piscataway, N.J.), with theexception of the IL4 probe, for which a single-stranded RNA probe wasemployed.

B. RT-PCR Analysis

[0201] About 1 μg of total RNA from different sources was reversetranscribed (RT) by random priming with “SUPERSCRIPT II” (GIBCO/BRL LifeTechnologies, Gaithersburg, Md.) in 20 μl of reaction mix as specifiedby the manufacturer. After heat inactivation, 1 μl of the RT-reactionwas used in a 30 μl PCR of 30 cycles of conventional PCR with theprimers and T_(ann) specified below. Each PCR reaction contained 20 mMTris-HCl pH 8.9 (at 25° C.), 16.7 mM (NH₄)₂SO₄, 1.5 mM MgCl₂, 200 μMdNTPs, 1 μM primers, and 0.8 U AmpliTaq (Cetus).

[0202] PCR-based detection of tissue-specific expression was performedusing the following PCR-amplifiable primary cDNA pools: Total Embryo (6,8, 12 weeks of gestation), Fetal Liver, Fetal Brain, Fetal Muscle,Placenta, Adult Heart, Adult Muscle, Adult Liver, Adult Brain, AdultPancreas, Adult Kidney, Adult Aorta, Adult Spleen, Adult Testis, AdultBone Marrow, JY B-cell line, Resting T-cells and Activated T-cells.

[0203] These cDNAs were either used directly as targets or PCR amplifiedfor 30 cycles using primer SEQ ID NO:5. Amplified cDNAs were purified ona “WIZARD-PCR” column (Promega, Madison, Wis.), quantitated, and used inPCR reactions with different specifically-designed primers. The primerused were as indicated in Table 1.

[0204] Each PCR reaction contained 50 ng of one cDNA sample or pool(amplified or unamplified) as the target. After 30 cycles of PCR theproducts were separated on agarose gels and the intensity of the signalsrecorded and represented in Table 1, above.

EXAMPLE 7 Identification of Gene Function by Homology and MotifIdentification A. Identification of the Human Homolog of the Yeast GeneRAD50

[0205] Three cDNA clones A106, G157, G170, selected with the YACs A94G6and 854G6 as described in Example 3, were mapped to chromosome5-specific cosmid 256E1 about 10 kb upstream of the IL13 gene. CloneA106, when used as a probe, detected a predominant and ubiquitous mRNAspecies of 1.9 kb on a Northern blot of various mRNA species, includingT-cells, B-cells, testis, small intestine, and brain. The primers A106-1and A106-2 (SEQ ID NO:36 and SEQ ID NO:37, respectively) were used in RTPCR to evaluate the tissue distribution and to extend the cDNA to itsfull length.

[0206] RT PCR analysis confirmed that this message was expressed inactivated adult T-cells, total embryo, fetal muscle, fetal liver,placenta, adult heart, and adult bone marrow. The extension of the A106cDNA clone confirmed that it is a human homolog of the yeast gene RAD50.Northern blot hybridizations with the C-terminal coding portion of thegene used as a probe revealed two mRNA species: a strong signal of about5.8. kb and a weaker signal at 6.5 kb.

[0207] A near full-length cDNA, termed G10 (also referred to as“rad50.seq”; SEQ ID NO:54), was obtained using marathon RACE (rapidamplification of cDNA ends; Chenchik, et al., 1995) techniques withactivated T-cell and testis cDNA marathon pools. A marathon cDNA pool incontrast to a regular cDNA pool has a special adapter at the ends ofcDNAs. Such cDNAs can not be SISPA amplified, because the adapter designsuppresses PCR with a single adapter-specific primer (Siebert, et al.,1995). Exponential PCR will be observed only if a gene-specific primeris employed along with the adapter-specific primer. Such cDNA poolsallow both 5′- and 3′-RACE amplifications, and finally isolation ofintact genes via combination PCR (Chenchik, et al., 1995).

[0208] cDNA clone G10 is about 5,800 bp long and encodes a protein of1312 aa with two highly-conserved domains with respect yeast RAD50: anN-terminal ATP-binding domain and a conserved C-terminal domain. Anon-coding 3′-flanking portion of the gene, when used as a probe,detected mRNA species of 1.9 and 0.85 kb in multiple tissues. This mayindicate either unusual alternative splicing of the RAD50 gene or anoverlap with another gene. RT-PCR and Northern blot analyses haveconfirmed that G10 is expressed in activated T-cells, B-cells, placentaand multiple fetal tissues, including fetal liver.

[0209] Clones G18 and H230 have a 31 bp stretch at their 3′-endhomologous to RAD50. RT-PCR analysis on different cDNA pools and genomicDNA with primers G18-1/2 (SEQ ID NO:50 and SEQ ID NO:51), respectivelysuggested alternative splicing of the RAD50. The RAD50 had been firstmapped by PCR using the primers A106 1/2 (SEQ ID NO:36 and SEQ ID NO:37)and then by YAC Southern blot hybridization. Several chromosome5-specific cosmids had been isolated that span the RAD50 gene. Thegenomic equivalent of G10 was found to be between about 80 and 150 kb inlength. RAD50 appears to be a large gene with at least six exons. TheC-terminal 2b fragment (˜6 kb) of RAD50 was sequenced, enabling thepositioning of four C-terminal exons.

C. Isolation and Mapping of cdc3 Human Homolog

[0210] Seventeen cDNAs encoding a novel cell division control gene wereidentified using direct selection with YAC clone 854G6. These cDNAsrepresent bin 23 in Table 1. The consensus sequence of these cDNAs wasextended using the marathon RACE technique and is presented herein asSEQ ID NO:97.

EXAMPLE 8 Assays to Evaluate Immunomodulatory Activity of Compounds orPolypeptides A. Peripheral Blood Lymphocyte (PBL) Proliferation Assay

[0211] Human peripheral blood lymphocytes are prepared using anestablished method (e.g., Boyum, 1968). Human blood buffy coat samplesare resuspended in a calcium and magnesium -free Hank's balanced saltsolution (HBSS, Gibco/BRL Life Technologies) at ˜24° C. Approximately 25ml of the cell suspension is then layered onto ˜15 ml of Ficoll-Paque(Pharmacia LKB Biotechnology, Inc.), and is centrifuged at ˜400×g for 30minutes at 15° C.

[0212] Following centrifugation, the PBL suspension at the interface istransferred to new centrifuge tubes, resuspended in a total volume of˜45 ml HBSS and centrifuged at ˜350×g for ˜10 minutes at 15° C. Thesupernatants are discarded and the PBL's are resuspended in 10 ml HBSS,combined, and centrifuged at ˜260×g for ˜10 minutes at 15° C. The cellpellets are suspended in 10 ml of X-Vivo tissue culture medium (BioWhittaker, Walkersville, Md.) and counted using a hemocytometer. Tissueculture medium is then added to achieve a final cell concentration of˜1×10⁶ cells/ml.

[0213] Proliferation assays are carried out in 96 well sterile tissueculture plates (e.g., Costar 3790 or Costar 3595). A volume of 100 μlPBL suspension is added to each well and the plates are incubated underan atmosphere of 93% air/7% CO₂ in a tissue culture incubator at 37° C.Compounds or polypeptides whose immunomodulatory activity is to beevaluated are then added to the wells. Different wells may havedifferent compounds or polypeptides, or they may have differentconcentrations of the same compound or polypeptide. The plates may alsohave several wells with the same immunomodulatory compound orpolypeptide at the same concentration, with other types ofimmunomodulatory compounds (e.g., small molecules) present in somewells.

[0214] After a selected period of time (e.g., 48 hours), ˜50 μl ofX-Vivo tissue culture medium containing ˜8 μCi/ml [³H]Thymidine(Amersham, ˜50 Ci/mmol) are added to each tissue culture well. Followingfour hours additional incubation at 37° C., the cells are removed fromthe tissue culture wells and applied to filter paper using, e.g., a cellharvester. The filter paper is dried and cut into small (e.g., 1 cm)discs, which are placed in a scintillation vial containing ˜2 ml ofscintillation fluid (Biosafe, Research Products International Corp.).Samples are then counted in a scintillation counter (e.g., the BeckmanLS 6000SC).

B. Spleen Cell Proliferation Assay

[0215] C3H mice are sacrificed by CO₂ inhalation and the spleens removedand cleaned of any fat or connective tissue. A nick is made in the tipof each spleen, and cells are collected by gentle aspiration through thetissue with Hank's balanced salt solution (HBSS) using a syringe and18-gauge needle. The resultant spleen cell solution is filtered throughNytex sterile nylon mesh (Tetco), centrifuged at 200×g for 10 minutes,resuspended in HBSS, and centrifuged as above.

[0216] The pelleted cells are resuspended in a small amount of HBSS,counted using a hemocytometer and then resuspended in RPMI 1640 medium(Gibco/BRL Life Technologies, catalog # 430-1800GL), containing2-mercaptoethanol (50 μM), glutamine (2 mM), penicillin (100 U/ml),streptomycin (100 mg/ml and 5% (v/v) fetal calf serum (Hyclone or Sigma)to a concentration of 2.5×10⁶ cells/ml.

[0217] A 100 μl volume of spleen cell solution is added to each well ofa 96-well plate. Compounds or polypeptides having immunomodulatoryactivity or medium alone are added in a volume of 50 μl. The culturesare incubated for 2 days (37° C., 5- CO₂), and tritiated thymidineincorporation is assayed as described above.

EXAMPLE 9 PCR-Based Detection of Activated T-Cells

[0218] Polymerase chain reaction amplifications were performed asdescribed above using cDNA derived from the following sources. Unlessotherwise indicated, the tissues samples were obtained from adultindividuals. (1) cDNA pool #3, (2) activated T-cells, (3) bone marrow,(4) fetal liver, (5) testis, (6) thymus, (7) peripheral leukocytes, (8)lymph node, (9) brain, (10) fetal thymus, (11) fetal brain, (12) spleen,(13) placenta, (14) muscle, (15) kidney and (16) heart.

[0219] Each 100 μl PCR reaction contained 50 ng of cDNA target, 50 pmolseach of primers A116-1 (SEQ ID NO:150) and A116-2 (SEQ ID NO:149), 200μM dNTPs, 2 mM MgCl2, 1x magnesium-free amplification buffer(Perkin-Elmer) and 2.5 U Taq DNA Polymerase. The primers were designedbased on the sequence shown in FIG. 1 (SEQ ID NO:151), which is aportion of the A116 sequence (SEQ ID NO:85). The locations of theprimers relative to the sequence are underlined.

[0220] The samples were cycled using a Perkin Elmer DNA Thermal Cycler480 (Norwalk, Conn.) thermal cycler for 30 times through the followingsteps: 30 s at 94° C., 30 s at 55° C. and 2 minutes at 72° C. Theamplification products were then separated on agarose gels, stained withethidium bromide, and visualized to determine their size. An exemplaryimage of such a gel is shown in FIG. 2. The lanes in the gel correspondto cDNA from tissues (1) through (12), above. Amplification products ofthe appropriate size were consistently detected only in samplescontaining activated T-cells (1 and 2) . Such amplification productswere not detected in any of the other samples (3-16), with the exceptionof fetal liver (4), where a much fainter signal was occasionallyobserved.

[0221] These results indicate that PCR-based amplification of a DNAfragment having the sequence SEQ ID NO:151 may be used as a sensitivediagnostic for the presence of activated T-cells in a sample of cells.An exemplary primer pair suitable for use with such an amplificationreaction consists of primers having sequences SEQ ID NO:149 and SEQ IDNO:150.

[0222] While the invention has been described with reference to specificmethods and embodiments, it is appreciated that various modificationsand changes may be made without departing from the invention.

1 151 22 base pairs nucleic acid single linear DNA NO NO oligonucleotide#4665 for adapter #3 1 GAGGATCCAG AATTCTCGAG TT 22 20 base pairs nucleicacid single linear DNA NO NO oligonucleotide #4666 for adapter #3 2CTCCTAGGTC TTAAGAGCTC 20 23 base pairs nucleic acid single linear DNA NONO oligonucleotide #A5-1 for adapter #5 3 TGGATCCTCT AGAGAGTGTG GTT 2321 base pairs nucleic acid single linear DNA NO NO oligonucleotide #A5-2for adapter #5 4 ACCTAGGAGA TCTCTCACAC C 21 21 base pairs nucleic acidsingle linear DNA NO NO oligo #AD3-2 for PCR amp of cDNAs 5 ACTCGAGAATTCTGGATCCT C 21 33 base pairs nucleic acid single linear DNA NO NO oligo#AD3-CUA for PCR amp of cDNAs 6 CUACUACUAC UAACTCGAGA ATTCTGGATC CTC 3321 base pairs nucleic acid single linear DNA NO NO oligo #A5-2b for PCRamp of cDNAs 7 CCACACTCTC TAGAGGATCC A 21 21 base pairs nucleic acidsingle linear DNA NO NO Primer GM-CSF-2 8 CCTTGACCAT GATGGCCAGC C 21 21base pairs nucleic acid single linear DNA NO NO Primer GM-CSF-1 9CCCGGCTTGG CCAGCCTCAT C 21 20 base pairs nucleic acid single linear DNANO NO Primer IL3-1 10 CTCTGTGGTG AGAAGGCCCA 20 20 base pairs nucleicacid single linear DNA NO NO Primer IL3-2 11 CTTCGAAGGC CAAACCTGGA 20 23base pairs nucleic acid single linear DNA NO NO Primer IL4-3 12GGTTTCCTTC TCAGTTGTGT TCT 23 21 base pairs nucleic acid single linearDNA NO NO Primer IL4-4 13 CTCACCTCCC AACTGCTTCC C 21 24 base pairsnucleic acid single linear DNA NO NO Primer IL5-1 14 CACCAACTGTGCACTGAAGA AATC 24 21 base pairs nucleic acid single linear DNA NO NOPrimer IL4-2 15 CCACTCGGTG TTCATTACAC C 21 22 base pairs nucleic acidsingle linear DNA NO NO Primer IL9-1 16 AGCTTCTGGC CATGGTCCTT AC 22 22base pairs nucleic acid single linear DNA NO NO Primer IL9-6 17TCAGCGCGTT GCCTGCCGTG GT 22 20 base pairs nucleic acid single linear DNANO NO Primer IL13-1 18 ATGGCGCTTT TGTTGACCAC 20 20 base pairs nucleicacid single linear DNA NO NO Primer IL13-5 19 CCTGCCTCGG ATGAGGCTCC 2020 base pairs nucleic acid single linear DNA NO NO Primer IRF1-7 20GAAGGCCAAC TTTCGCTGTG 20 20 base pairs nucleic acid single linear DNA NONO Primer IRF1-8 21 CACTGGGATG TGCCAGTCGG 20 23 base pairs nucleic acidsingle linear DNA NO NO Primer TCF7-3 22 CCGTTCCTTC CGATGACAGT GCT 23 20base pairs nucleic acid single linear DNA NO NO Primer TCF7-4 23GACATCAGCC AGAAGCAAGT 20 22 base pairs nucleic acid single linear DNA NONO Primer EGRI1-6 24 CCACCTCCTC TCTCTCTTCC TA 22 21 base pairs nucleicacid single linear DNA NO NO Primer EGRI1-7 25 TCCATGGCAC AGATGCTGTA C21 24 base pairs nucleic acid single linear DNA NO NO Primer CD14-5 26CCGCTGGTGC ACGTCTCTGC GACC 24 21 base pairs nucleic acid single linearDNA NO NO Primer CD14-6 27 CACGCCGGAG TTCATTGAGC C 21 22 base pairsnucleic acid single linear DNA NO NO Primer CDC25-3 28 GAGAGGAAGGAAGCTCTGGC TC 22 22 base pairs nucleic acid single linear DNA NO NOPrimer CDC25-5 29 GTCCTGAAGA ATCCAGGTGA CC 22 25 base pairs nucleic acidsingle linear DNA NO NO Primer Sau3AI-2 for semiadapter #2 30 TCGCGGCCGAATTCTAGAGC TCGCT 25 21 base pairs nucleic acid single linear DNA NO NOPrimer Sau3AI-3 31 CGCCGGCTTA AGATCTCGAG C 21 28 base pairs nucleic acidsingle linear DNA NO NO Primer Sau3AI S-1 32 GATCTCGAGG ATCCTCAGAGAGTAGTAG 28 24 base pairs nucleic acid single linear DNA NO NO PrimerSau3AI S-2 for adapter #S-1/2 33 AGCTCCTAGG AGTCTCTCAT CATC 24 23 basepairs nucleic acid single linear DNA NO NO 5′Biotin-YAC primer #1 PCRamp of YACs 34 AGCGAGCTCT AGAATTCGGC CGC 23 26 base pairs nucleic acidsingle linear DNA NO NO 5′Biotin-YAC primer #2 PCR amp of YACs 35CTACTACTCT CTGAGGATCC TCGAGA 26 20 base pairs nucleic acid single linearDNA NO NO Primer A106-1 for RAD50 36 GTCATCCAGA CTCAGAGCTC 20 20 basepairs nucleic acid single linear DNA NO NO Primer A106-2 for RAD50 37CTGTCTAGGC AAACATGCTC 20 38 base pairs nucleic acid single linear DNA NONO Primer G10-C for RAD50 38 GAGAGGAATT CTTTTAATGA ACATTGAATC CCAGGGAG38 39 base pairs nucleic acid single linear DNA NO NO Primer G10-N forRAD50 39 GAGAGGATCC TTTGTGGACT CCAGGTCCCT GGTGAGATT 39 21 base pairsnucleic acid single linear DNA NO NO Primer G34-2 40 CCACACTGATGAACACACTC T 21 21 base pairs nucleic acid single linear DNA NO NOPrimer G34-3 41 AGCTCGCTCT TGGAGATGGT G 21 21 base pairs nucleic acidsingle linear DNA NO NO Primer G34-4 42 TGGCTTCCTC AGTCTCGAAG G 21 21base pairs nucleic acid single linear DNA NO NO Primer G34-5 43CACCATCTCC AAGAGCGAGC T 21 24 base pairs nucleic acid single linear DNANO NO Primer G34-6 44 CACCATGAGG CATGCGTGCG CCTG 24 24 base pairsnucleic acid single linear DNA NO NO Primer G34-7 45 CAGGCGCACGCATGCCTCAT GGTG 24 24 base pairs nucleic acid single linear DNA NO NOPrimer G34-8 46 GTAGATCTGG ACCCCGTTGC TGAC 24 24 base pairs nucleic acidsingle linear DNA NO NO Primer G34-9 47 GTCAGCAACG GGGTCCAGAT CTAC 24 27base pairs nucleic acid single linear DNA NO NO Primer G34-10 48ACCAGTTCCC CACGGATGAT GAGGCTG 27 27 base pairs nucleic acid singlelinear DNA NO NO Primer G34-11 49 CCTCCGCGAG CAGACCCACA GCCGGCA 27 23base pairs nucleic acid single linear DNA NO NO Primer G18-1 50ATCAGACCAG GGACAGACTT GCC 23 22 base pairs nucleic acid single linearDNA NO NO Primer G18-2 51 CATCTTCTTC ATGCCCTAAC TG 22 20 base pairsnucleic acid single linear DNA NO NO oligonucleotide #4578 52 TAGGAGATCTCTTAAGAGCT 20 24 base pairs nucleic acid single linear DNA NO NOoligonucleotide #4579 53 TCTCGAGAAT TCTCTAGAGG ATCC 24 5893 base pairsnucleic acid double linear cDNA to mRNA NO NO Rad50.seq CDS 389..4324 54CCAGGAGAGC GGCGTGGACG CGTGCGGGCC TAGAGGCCCA CGTGATCCGC AGGGCGGCCG 60AGGCAGGAAG CTTGTGAGTG CGCGGTTGCG GGGTCGCATT GTGGCTACGG CTTTGCGTCC 120CCGGCGGGCA GCCCCAGGCT GGTCCCCGCC TCCGCTCTCC CCACCGGCGG GGAAAGCAGC 180TGGTGTGGGA GGAAAGGCTC CATCCCCCGC CCCCTCTCTC CCGCTGTTGG CTGGCAGGAT 240CTTTTGGCAG TCCTGTGGCC TCGCTCCCCG CCCGGATCCT CCTGACCCTG AGATTCGCGG 300GTCTCACGTC CCGTGCACGC CTTGCTTCGG CCTCAGTTAA GCCTTTGTGG ACTCCAGGTC 360CCTGGTGAGA TTAGAAACGT TTGCAAACAT GTCCCGGATC GAAAAGATGA GCATTCTGGG 420CGTGCGGAGT TTTGGAATAG AGGACAAAGA TAAGCAAATT ATCACTTTCT TCAGCCCCCT 480TACAATTTTG GTTGGACCCA ATGGGGCGGG AAAGACGACC ATCATTGAAT GTCTAAAATA 540TATTTGTACT GGAGATTTCC CTCCTGGAAC CAAAGGAAAT ACATTTGTAC ACGATCCCAA 600GGTTGCTCAA GAAACAGATG TGAGAGCCCA GATTCGTCTG CAATTTCGTG ATGTCAATGG 660AGAACTTATA GCTGTGCAAA GATCTATGGT GTGTACTCAG AAAAGCAAAA AGACAGAATT 720TAAAACTCTG GAAGGAGTCA TTACTAGAAC AAAGCATGGT GAAAAGGTCA GTCTGAGCTC 780TAAGTGTGCA GAAATTGACC GAGAAATGAT CAGTTCTCTT GGGGTTTCCA AGGCTGTGCT 840AAATAATGTC ATTTTCTGTC ATCAAGAAGA TTCTAATTGG CCTTTAAGTG AAGGAAAGGC 900TTTGAAGCAA AAGTTTGATG AGATTTTTTC AGCAACAAGA TACATTAAAG CCTTAGAAAC 960ACTTCGGCAG GTACGTCAGA CACAAGGTCA GAAAGTAAAA GAATATCAAA TGGAACTAAA 1020ATATCTGAAG CAATATAAGG AAAAAGCTTG TGAGATTCGT GATCAGATTA CAAGTAAGGA 1080AGCCCAGTTA ACATCTTCAA AGGAAATTGT CAAATCCTAT GAGAATGAAC TTGATCCATT 1140GAAGAATCGT CTAAAAGAAA TTGAACATAA TCTCTCTAAA ATAATGAAAC TTGACAATGA 1200AATTAAAGCC TTGGATAGCC GAAAGAAGCA AATGGAGAAA GATAATAGTG AACTGGAAGA 1260GAAAATGGAA AAGGTTTTTC AAGGGACTGA TGAGCAACTA AATGACTTAT ATCACAATCA 1320CCAGAGAACA GTAAGGGAGA AAGAAAGGAA ATTGGTAGAC TGTCATCGTG AACTGGAAAA 1380ACTAAATAAA GAATCTAGGC TTCTCAATCA GGAAAAATCA GAACTGCTTG TTGAACAGGG 1440TCGTCTACAG CTGCAAGCAG ATCGCCATCA AGAACATATC CGAGCTAGAG ATTCATTAAT 1500TCAGTCTTTG GCAACACAGC TAGAATTGGA TGGCTTTGAG CGTGGGCCAT TCAGTGAAAG 1560ACAGATTAAA AATTTTCACA AACTTGTGAG AGAGAGACAA GAAGGGGAAG CAAAAACTGC 1620CAACCAACTG ATGAATGACT TTGCAGAAAA AGAGACTCTG AAACAAAAAC AGATAGATGA 1680GATAAGAGAT AAGAAAACTG GACTGGGAAG AATAATTGAG TTAAAATCAG AAATCCTAAG 1740TAAGAAGCAG AATGAGCTGA AAAATGTGAA GTATGAATTA CAGCAGTTGG AAGGATCTTC 1800AGACAGGATT CTTGAACTGG ACCAGGAGCT CATAAAAGCT GAACGTGAGT TAAGCAAGGC 1860TGAGAAAAAC AGCAATGTAG AAACCTTAAA AATGGAAGTA ATAAGTCTCC AAAATGAAAA 1920AGCAGACTTA GACAGGACCC TGCGTAAACT TGACCAGGAG ATGGAGCAGT TAAACCATCA 1980TACAACAACA CGTACCCAAA TGGAGATGCT GACCAAAGAC AAAGCTGACA AAGATGAACA 2040AATCAGAAAA ATAAAATCTA GGCACAGTGA TGAATTAACC TCACTGTTGG GATATTTTCC 2100CAACAAAAAA CAGCTTGAAG ACTGGCTACA TAGTAAATCA AAAGAAATTA ATCAGACCAG 2160GGACAGACTT GCCAAATTGA ACAAGGAACT AGCTTCATCT GAGCAGAATA AAAATCATAT 2220AAATAATGAA CTAAAAAGAA GGGAAGAGCA GTTGTCCAGT TACGAAGACA AGCTGTTTGA 2280TGTTTGTGGT AGCCAGGATT TTGAAAGTGA TTTAGACAGG CTTAAAGAGG AAATTGAAAA 2340ATCATCAAAA CAGCGAGCCA TGCTGGCTGG AGCCACAGCA GTTTACTCCC AGTTCATTAC 2400TCAGCTAACA GACGAAAACC AGTCATGTTG CCCCGTTTGT CAGAGAGTTT TTCAGACAGA 2460GGCTGAGTTA CAAGAAGTCA TCAGTGATTT GCAGTCTAAA CTGCGACTTG CTCCAGATAA 2520ACTCAAGTCA ACAGAATCAG AGCTAAAAAA AAAGGAAAAG CGGCGTGATG AAATGCTGGG 2580ACTTGTGCCC ATGAGGCAAA GCATAATTGA TTTGAAGGAG AAGGAAATAC CAGAATTAAG 2640AAACAAACTG CAGAATGTCA ATAGAGACAT ACAGCGCCTA AAGAACGACA TAGAAGAACA 2700AGAAACACTC TTGGGTACAA TAATGCCTGA AGAAGAAAGT GCCAAAGTAT GCCTGACAGA 2760TGTTACAATT ATGGAGAGGT TCCAGATGGA ACTTAAAGAT GTTGAAAGAA AAATTGCACA 2820ACAAGCAGCT AAGCTACAAG GAATAGACTT AGATCGAACT GTCCAACAAG TCAACCAGGA 2880GAAACAAGAG AAACAGCACA AGTTAGACAC AGTTTCTAGT AAGATTGAAT TGAATCGTAA 2940GCTTATACAG GACCAGCAGG AACAGATTCA ACATCTAAAA AGTACAACAA ATGAGCTAAA 3000ATCTGAGAAA CTTCAGATAT CCACTAATTT GCAACGTCGT CAGCAACTGG AGGAGCAGAC 3060TGTGGAATTA TCCACTGAAG TTCAGTCTTT GTACAGAGAG ATAAAGGATG CTAAAGAGCA 3120GGTAAGCCCT TTGGAAACAA CATTGGAAAA GTTCCAGCAA GAAAAAGAAG AATTAATCAA 3180CAAAAAAAAT ACAAGCAACA AAATAGCACA GGATAAACTG AATGATATTA AAGAGAAGGT 3240TAAAAATATT CATGGCTATA TGAAAGACAT TGAGAATTAT ATTCAAGATG GGAAAGACGA 3300CTATAAGAAG CAAAAAGAAA CTGAACTTAA TAAAGTAATA GCTCAACTAA GTGAATGCGA 3360GAAACACAAA GAAAAGATAA ATGAAGATAT GAGACTCATG AGACAAGATA TTGATACACA 3420GAAGATACAA GAAAGGTGGC TACAAGATAA CCTTACTTTA AGAAAAAGAA ATGAGGAACT 3480AAAAGAAGTT GAAGAAGAAA GAAAACAACA TTTGAAGGAA ATGGGTCAAA TGCAGGTTTT 3540GCAAATGAAA AGTGAACATC AGAAGTTGGA AGAGAACATA GACAATATAA AAAGAAATCA 3600TAATTTGGCA TTAGGGCGAC AGAAAGGTTA TGAAGAAGAA ATTATTCATT TTAAGAAAGA 3660ACTTCGAGAA CCACAATTTC GGGATGCTGA GGAAAAGTAT AGAGAAATGA TGATTGTTAT 3720GAGGACAACA GAACTTGTGA ACAAGGATCT GGATATTTAT TATAAGACTC TTGACCAAGC 3780AATAATGAAA TTTCACAGTA TGAAAATGGA AGAAATCAAT AAAATTATAC GTGACCTGTG 3840GCGAAGTACC TATCGTGGAC AAGATATTGA ATACATAGAA ATACGGTCTG ATGCCGATGA 3900AAATGTATCA GCTTCTGATA AAAGGCGGAA TTATAACTAC CGAGTGGTGA TGCTGAAGGG 3960AGACACAGCC TTGGATATGC GAGGACGATG CAGTGCTGGA CAAAAGGTAT TAGCCTCACT 4020CATCATTCGC CTGGCCCTGG CTGAAACGTT CTGCCTCAAC TGTGGCATCA TTGCCTTGGA 4080TGAGCCAACA ACAAATCTTG ACCGAGAAAA CATTGAATCT CTTGCACATG CTCTGGTTGA 4140GATAATAAAA AGTCGCTCAC AGCAGCGTAA CTTCCAGCTT CTGGTAATCA CTCATGATGA 4200AGATTTTGTG GAGCTTTTAG GACGTTCTGA ATATGTGGAG AAATTCTACA GGATTAAAAA 4260GAACATCGAT CAGTGCTCAG AGATTGTGAA ATGCAGTGTT AGCTCCCTGG GATTCAATGT 4320TCATTAAAAA TATCCAAGAT TTAAATGCCA TAGAAATGTA GGTCCTCAGA AAGTGTATAA 4380TAAGAAACTT ATTTCTCATA TCAACTTAGT CAATAAGAAA ATATATTCTT TCAAAGGAAC 4440ATTGTGTCTA GGATTTTGGA TGTTGAGAGG TTCTAAAATC ATGAAACTTG TTTCACTGAA 4500AATTGGACAG ATTGCCTGTT TCTGATTTGC TGCTCTTCAT CCCATTCCAG GCAGCCTCTG 4560TCAGGCCTTC AGGGTTCAGC AGTACAGCCG AGACTCGACT CTGTGCCTCC CTCCCCAGTG 4620CAAATGCATG CTTCTTCTCA AAGCACTGTT GAGAAGGAGA TAATTACTGC CTTGAAAATT 4680TATGGTTTTG GTATTTTTTT AAATCATAGT TAAATGTTAC CTCTGAATTT ACTTCCTTGA 4740CATGTGGTTT GAAAAACTGA GTATTAATAT CTGAGGATGA CCAGAAATGG TGAGATGTAT 4800GTTTGGCTCT GCTTTTAACT TTATAAATCC AGTGACCTCT CTCTCTGGGA CTTGGTTTCC 4860CCAACTAAAA TTTGAAGTAG TTGAATGGGG TCTCAAAGTT TGACAGGAAC CTTAAGTAAT 4920CATCTAAGTC AGTACCCACC ACCTTCTTCT CCTACATATC CCTTCCAGAT GGTCATCCAG 4980ACTCAGAGCT CTCTCTACAG AGAGGAAATT CTCCACTGTG CACACCCACC TTTGGAAAGC 5040TCTGACCACT TGAGGCCTGA TCTGCCCATC GTGAAGAAGC CTGTAACACT CCTCTGCGTC 5100TATCCTGTGT AGCATACTGG CTTCACCATC AATCCTGATT CCTCTCTAAG TGGGCATTGC 5160CATGTGGAAG GCAAGCCAGG CTCACTCACA GAGTCAAGGC CTGCTCCCTG TAGGGTCCAA 5220CCAGACCTGG AAGAACAGGC CTCTCCATTT GCTCTTCAGA TGCCACTTCT AAGAAAAGCC 5280TAATCACAGT TTTTCCTGGA ATTGCCAGCT GACATCTTGA ATCCTTCCAT TCCACACAGA 5340ATGCAACCAA GTCACACGCT TTTGAATTAT GCTTTGTAGA GTTTTGTCAT TCAGAGTCAG 5400CCAGGACCAT ACCGGGTCTT GATTCAGTCA CATGGCATGG TTTTGTGCCA TCTGTAGCTA 5460TAATGAGCAT GTTTGCCTAG ACAGCTTTTC TCAACTGGGT CCAGAAGAGA ATTAAGCCCT 5520AAGGTCCTAA GGCATCTATC TGTGCTAGGT TAAATGGTTG GCCCCCAAAG ATAGACAGGT 5580CCTGATTTCT AGAACCCGTG ACTGTTACTT TATACAGCAA AGGAAACTTT GCAGATGTGA 5640TTAAAGCTAA GGACCTTAAG ACAGAGTATC CTGGGGGTGG TGGTGGGGTG GGGGGGGGTC 5700CTAAATGTAA TCACGAGTAA GATTAAGAGC CAATCAATTC TAGTCATATA TTAAACATCC 5760ACAATAACCA AGATATTTTT ATCCCAAGAA TGCAAGATTT CAGAAAATGA AAAATCTGTT 5820GATAAATCCA TCACTATAAT AAAACCGAAG GTGAAAAAAA TTCTGAAAAA AAAAAAAAAA 5880AAAAAAAAAA AAA 5893 472 base pairs nucleic acid double linear cDNA tomRNA NO NO G18.seq 55 GTGGAAGAAT GGTGAAATCA TTGATACTTT ACAACAAGTTTATGAGATCA ATGCCCCAAA 60 CAAATCAGCA GTTTACAAAT GGATAACTCA GTTTAAGAAGGGATGAGACG ATATTAAAGA 120 TGAAGCCCAC AGTGACAGAC TGTTCACATC AATTTGTGAGGAAAAAAATC ATCTTCTTCA 180 TGCCCTAACT GAAGAAGATC AATGATTAAC AGCAGAAACAATAGCCAACA CCATAGACAC 240 CTCAATTGAT TCAGGTTACA CAATTCTGAC TGAAAAATTAAAGTTGAGTA AACGTTCTAC 300 TTGATGGATG CCCAAATCAC TGCTTCCAGA TCAGCTGCAGACAACAGCAG AACTTCCTCA 360 ATAAGTGGGA TCAAGTTCCT AAAGCATTTC TTCAAAGAATTGTAACAGGA GGTGATGGAA 420 TGTGGCTTTA CCAGTACAAT CTTCAATTTG GCAAGTCTGTCCCTGGTCTG AT 472 1189 base pairs nucleic acid double linear cDNA tomRNA NO NO Tc1.seq 56 CCTGTGAACA TTGACAATAT ATTACTTTTA GTGGTACACAGTTCTTGAGA AAATGTCTTG 60 ATTTTTACAT TGCCATTTGT GATATTTTTA GCAGTCCACCACAATATCAT TTTTATAATA 120 AAAATAAAAT ATACTCATTG ATGATAGAGA AAATATTGTTAAAGACCTCT TGGGACAGGA 180 AAAGGCTCAG TCATAAAATC AGATGCTTAT TCATTTTCAGCTGTGTCATT TTGACTCATT 240 ACTTTCAAGA ATAACTATAA TATTGCTAGA CAGTTCATTACACTGAGAAG AACTTTCCTT 300 GAACTTCACA TGGAGATTGA GTAAAGCTCT TCTATTTGTTTTTTGAAGTA CTCTCTCAGC 360 TCAGGTCTCT TAGCTTTTAG TGTTGGTGTC AGCAAGCCATTTTGAACTGA GAACATGTCA 420 GAATGGATGT GAATGGCTTT AACCTGCTCA AAAGAATGGAGTCCACTTTC TTTTCCTAAC 480 CTCACCATAT CTTCCAAAAT GGCTTTCTTC AGATCCTTATTTGTGCAGAG ATCTGCATAT 540 GTTCCTTCAA TTCCTCTCTT CTGGGCCCAG GAGGGCATAACTTCAGGGTC AGGCACAACA 600 ATGCCTACCA AAAAGGCCTT TAAGCTGTCC CCATGGACATAGATTTGCGC CACAGGTTGG 660 CTCCGGATGT AGATGTTCTC AATCTTCTCG GGTGCAACATATTCTCCCTG AGCAAGTTTA 720 AATATATGCT TTTTCCGATC AATAATTTTA AGAGTTCCTGCCGGCAGCCA TTTTCCGATG 780 TCTCCAGTGT GAAGCCAGCC ATCGCTGTCC AGGGCCTCCTTCGTCCTGTC TGGATCTTTC 840 AAGTAGCCTC TGAACACATT TGGTCCTCTC ACACATATCTCTCCCTCTCC TTTGCAGGCC 900 CAGTAGTTCA GTTCCTCAAC ATCAACGAGC TTGATATGATTGCAGGGAAG TGGCGCCCCT 960 ACGTGCCCTG AGGTCCAGTC GCCAGGAGTG GTGAAGGTACATCCAGCTGT GCACTCAGTT 1020 TGGCCATAAC CTTCATAAAC CTGGCACCCT AGAGCTGCCCGGAGAAATCC CAGAACTGTT 1080 GGTGATGCTG GGGCTGCTCC AGTAACAATC ATCCGCACAAGTCACCTAGC TCAGGCAGTC 1140 GCAGTATCCT CAGGATCTCC TGTGTCTGCA TCTTCTCAGAAGTGAGAGG 1189 987 base pairs nucleic acid double linear cDNA to mRNA NONO Tc2.seq 57 CCTCTCACTT CTGAGAAGAT GCAGACACAG GAGATCCTGA GGATACTGCGACTGCCTGAG 60 CTAGGTGGCT TGGGACAGTT TTTCCGCAGC CTCTCGGCCA CCACCCTCGTGAGTATGGGT 120 GCCCTGGCTG CCATCCTTGC CTACTGGTTC ACTCACCGGC CAAAGGCCTTGCAGCCGCCA 180 TGCAACCTCC TGATGCAGTC AGAAGAAGTA GAGGACAGTG GCGGGGCACGGCGATCTGTG 240 ATTGGGTCTG GCCCTCAGCT ACTTACCCAC TACTATGATG ATGCCCGGATCATGTACCTG 300 GTGTCCCGCC GTGGGCTTAG CATCTCAGGG AATGGGCCCT GTCTTGGTTTCAGGAAGCCT 360 AAGCAGCCTT ACCAGTGGCT GTCCTACCAG GAGGTGGCCG ACAGGGCTGAATTTCTGGGG 420 TCCGGACTTC TCCAGCACAA TTGTAAAGCA TGCACTGTCA GTTTATTGGTGTTTTTGCAC 480 AAAATCGGCC AGAGTGGATC ATTGTGGAGC TGGCCTGCTA CACATATTCCATTCTTTTGA 540 GCAGGTTAAG GCCATTTACA TCCATTCTGA CATGTTCTCA GTTCAAAATGGCTTGCTGAC 600 ACCAACACTA AAAGCTAAGA GACCTGAGCT GAGAGAGTAC TTCAAAAAACAAATAGAAGA 660 GCTTTACTCA ATCTCCATGT GAAGTTCAAG GAAAGTTCTT CTCAGTGTAATAAACTGTCT 720 AGCAATATTA TAGTTATTCT TGAAAGTAAT GAGTCAAAAT GACACAACTGAAAATGAATA 780 AGCATCTGAT TTTATGACTG AGCCTTTTCC TGTCCCATGA GGTCTTTAACAATATTTTCT 840 CTATCATCAA TGAGTATATT TTATTTTTAT TATAAAAATG ATATTGTGGTGGACTGCTAA 900 AAATATCACA AGTGGCAATG TAAAAATCAA GACATTTTCT CAAGAACTGTGTACCACTAA 960 AAGTAATATA TTGTCAATGT TCACAGG 987 691 base pairs nucleicacid double linear cDNA to mRNA NO NO Tc3.seq 58 CCTGTGAACA TTGACAATATATTACTTTTA GTGGTACACA GTTCTTGAGA AAATGTCTTG 60 ATTTTTACAT TGCCATTTGTGATATTTTTA GCAGTCCACC ACAATATCAT TTTTATAATA 120 AAAATAAAAT ATACTCATTGATGATAGAGA AAATATTGGA GGATCCAGAA TTCTCGAGTT 180 GCCTCCTTTT TTGGCAGACTTCATCTTCTC ATCTCCCAAA CCCCCTGAGC CCGTAGGGTT 240 TTCATAGTGG ACAAAGAACTTGTGGTCTTT TAAAACTGGG ACTGATACTT TTTTGAGAGA 300 GTATCGTGTC GAAAGTGTGATGTTCTACCA CTTTACCAAT AACTAATTTT AAATACACAT 360 TGTCCTCTCG ATTTTTGGACCAAACAGACG CTCACAGTGG AGGCTTATCA AGGGTTGCAT 420 TGGGGAAGAA GCCTCTCCCTCTCTGTCAGC ACCAGCTGGT AAAGGTGACT GTACAGATGT 480 GCATTTTCCT TTTGGTATAAATGGTCCACA GCACTAACTG GTAAGGCTTA TTGTGCAGTA 540 TATTGTCAGT ATTCTTCTGGTTCAGCATGC CTTATAGTTC ANATATAACC TGTATTAANT 600 GTATAGATTG TGCAGTAAAAGCTGTTACCA AGTTGTCAGA ACATAAGAGC GAAAACAAGG 660 TCATATGTAA TATATTGTCAATGTTCACAG G 691 2511 base pairs nucleic acid double linear cDNA to mRNANO NO TcA.seq 59 CCTCTCACTT CTGGAGAAGA TGCAGACACA GGAGATCCTG AGGATACTGCGACTGCCTGA 60 GCTAGGTGAC TTGGGACAGT TTTTCCGCAG CCTCTCGGCC ACCACCCTCGTGAGTATGGG 120 TGCCCTGGCT GCCATCCTTG CCTACTGGTT CACTCACCGG CCAAAGGCCTTGCAGCCGCC 180 ATGCAACCTC CTGATGCAGT CAGAAGAAGT AGAGGACAGT GGCGGGGCACGGCGATCTGT 240 GATTGGGTCT GGCCCTCAGC TACTTACCCA CTACTATGAT GATGCCCGGACCATGTACCA 300 GGTGTTCCGC CGTGGGCTTA GCATCTCAGG GAATGGGCCC TGTCTTGGTTTCAGGAAGCC 360 TAAGCAGCCT TACCAGTGGC TGTCCTACCA GGAGGTGGCC GACAGGGCTGAATTTCTGGG 420 GTCCGGACTT CTCCAGCACA ATTGTAAAGC ATGCACTGAT CAGTTTATTGGTGTTTTTGC 480 ACAAAATCGG CCAGAGTGGA TCATTGTGGA GCTGGCCTGC TACACATATTCCATGGTGGT 540 GGTCCCGCTC TATGACACCC TGGGCCCTGG GGCTATCCGC TACATCATCAATACAGGGCT 600 CAGCTGCCAA GAAGGAGCCT CTGCAACAGC CTCCACACAG GGTACAGCCCTCTGAAGTTC 660 ATGACAGCTT GGCACAGATG CAGGGGGTGC GGACATCAGC ACCGTGATTGTGGACAAACC 720 TCAGAAGGCT GTGCTTCTGC TAGAGCATGT GGAGAGGAAG GAGACTCCAGGCCTCAAGCT 780 GATCATCCTC ATGGACCCAT TCGAAGAAGC CCTGAAAGAG AGAGGGCAGAAGTGCGGGGT 840 GGTCATTAAG TCCATGCAGG CCGTGGAGGA CTGTGGCCAA GAGAATCACCAGGCTCCTGT 900 GCCCCCGCAG CCTGATGACC TCTCCATTGT GTGTTTCACA AGCGGCACGACAGGGAACCC 960 AAAAGGTGCG ATGCTCACCC ATGGGAACGT GGTGGCTGAT TTCTCAGGCTTTCTGAAAGT 1020 GACAGAGAGT CAGTGGGCTC CCACTTGTGC GGATGTGCAC ATTTCCTAGTTGCCTTTAGC 1080 ACACATGTTT GAGCGAATGG TGCAGTCTGT CGTCTATTGC CACGGAGGGCGTGTTGGCTT 1140 CTTCCAGGGA GATATCCGCC TTCTCTCAGA TGACATGAAG GCTCTATGCCCCACCATCTT 1200 CCCTGTGGTC CCACGACTGC TGAACCGGAT GTACGACAAG ATCTTCAGCCAGGCAAACAC 1260 ACCATTAAAG CGCTGGCTCC TGGAGTTTGC AGCAAAACGT AAGCAAGCCGAGGTCCGGAG 1320 TGGAATCATC AGGAATGATA GTATCTGGGA TGAACTCTTC TTTAATAAGATTCAGGCCAG 1380 TCTTGGTGGG TGTGTGCGGA TGATTGTTAC TGGAGCAGCC CCAGCATCACCAACAGTTCT 1440 GGGATTTCTC CGGGCAGCTC TAGGGTGCCA GGTTTATGAA GGTTATGGCCAAACTGAGTG 1500 CACAGCTGGA TGTACCTTCA CCACTCCTGG CGACTGGACC TCAGGGCACGTAGGGGCGCC 1560 ACTTCCCTGC AATCATATCA AGCTCGTTGA TGTTGAGGAA CTGAACTACTGGGCCTGCAA 1620 AGGAGAGGGA GAGATATGTG TGAGAGGACC AAATGTGTTC AAAGGCTACTTGAAAGATCC 1680 AGACAGGACG AAGGAGGCCC TGGACAGCGA TGGCTGGCTT CACACTGGAGACATCGGAAA 1740 ATGGCTGCCG GCAGGAACTC TTAAAATTAT TGATCGGAAA AAGCATATATTTAAACTTGC 1800 TCAGGGAGAA TATGTTGCAC CCGAGAAGAT TGAGAACATC TACATCCGGAGCCAACCTGT 1860 GGCGCAAATC TATGTCCATG GGGACAGCTT AAAGGCCTTT TTGGTAGGCATTGTTGTGCC 1920 TGACCCTGAA GTTATGCCCT CCTGGGCCCA GAAGAGAGGA ATTGAAGGAACATATGCAGA 1980 TCTCTGCACA AATAAGGATC TGAAGAAAGC CATTTTGGAA GATATGGTGAGGTTAGGAAA 2040 AGAAAGTGGA CTCCATTCTT TTGAGCAGGT TAAAGCCATT CACATCCATTCTGACATGTT 2100 CTCAGTTCAA AATGGCTTGC TGACACCAAC ACTAAAAGCT AAGAGACCTGAGCTGAGAGA 2160 GTACTTCAAA AAACAAATAG AAGAGCTTTA CTCAATCTCC ATGTGAAGTTCAAGGAAAGT 2220 TCTTCTCAGT GTAATGAACT GTCTAGCAAT ATTATAGTTA TTCTTGAAAGTAATGAGTCA 2280 AAATGACACA GCTGAAAATG AATAAGCATC TGATTTTATG ACTGAGCCTTTTCCTGTCCC 2340 AAGAGGTCTT TAACAATATT TTCTCTATCA TCAATGAGTA TATTTTATTTTTATTATAAA 2400 AATGATATTG TGGTGGACTG CTAAAAATAT CACAAATGGC AATGTAAAAATCAAGACATT 2460 TTCTCAAGAA CTGTGTACCA CTAAAAGTAA TATATTGTCA ATGTTCACAG G2511 2416 base pairs nucleic acid double linear cDNA to mRNA NO NOTcB.seq 60 CCTCTCACTT CTGGAGAAGA TGCAGACACA GGAGATCCTG AGGATACTGCGACAGCCTGA 60 GCTAGGTGAC TTGGGACAGT TTTTCCGCAG CCTCTCGGCC ACCACCCTCGTGAGTATGGG 120 TGCCCTGGCT GCCATCCTTG CCTACTGGTT CACTCACCGG CCAAAGGCCTTGCAGCCGCC 180 ATGCAACCTC CTGATGCAGT CAGAAGAGGT AGAGGACAGT GGCGGGGCACGGCGATCTGT 240 GATTGGGTCT GGCCCTCAGC TACTTACCCA CTACTATGAT GATGCCCGGACCATGTACCA 300 GGTGTTCCGC CGTGGGCTTA GCATCTCAGG GAATGGGCCC TGTCTTGGTTTCAGGAAGCC 360 TAAGCAGCCT TACCAGTGGC TGTCCTACCA GGAGGTGGCC GACAGGGCTGAATTTCTGGG 420 GTCCGGACTT CTCCAGCACA ATTGTAAAGC ATGCACTGAT CAGTTTATTGGTGTTTTTGC 480 ACAAAATCGG CCAGAGTGGA TCATTGTGGA GCTGGCCTGC TACACATATTCCATGGTGGT 540 GGTCCCGCTC TATGACACCC TGGGCCCTGG GGCTATCCGC TACATCATCAATACAGCGGA 600 CATCAGCACC GTGATTGTGG ACAAACCTCA GAAGGCTGTG CTTCTGCTAGAGCATGTGGA 660 GAGGAAGGAG ACTCCAGGCC TCAAGCTGAT CATCCTCATG GACCCATTCGAAGAAGCCCT 720 GAAAGAGAGA GGGCAGAAGT GCGGGGTGGT CATTAAGTCC ATGCAGGCCGTGGAGGACTG 780 TGGCCAAGAG AATCACCAGG CTCCTGTGCC CCCGCAGCCT GATGACCTCTCCATTGTGTG 840 TTTCACAAGC GGCACGACAG GGAACCCAAA AGGTGCGATG CTCACCCATGGGAACGTGGT 900 GGCTGATTTC TCAGGCTTTC TGAAAGTGAC AGAGAGTCAG TGGGCTCCCACTTGTGCGGA 960 TGTGCACACT TCCTATTTGC CTTTAGCACA CATGTTTGAG CGAATGGTGCAGTCTGTCGT 1020 CTATTGCCAC GGAGGGCGTG TTGGCTTCTT CCAGGGAGAT ATCCGCCTTCTCTCAGATGA 1080 CATGAAGGCT CTATGCCCCA CCATCTTCCC TGTGGTCCCA CGACTGCTGAACCGGATGTA 1140 CGACAAGATC TTCAGCCAGG CAAACACACC ATTAAAGCGC TGGCTCCTGGAGTTTGCAGC 1200 AAAGCGTAAG CAAGCCGAGG TCCGGAGTGG AATCATCAGG AATGATAGTATCTGGGATGA 1260 ACTCTTCTTT AATAAGATTC AGGCCAGTCT TGGTGGGTGT GTGCGGATGATTGTTACTGG 1320 AGCAGCCCCA GCATCACCAA CGGTTCTGGG ATTTCTCCGG GCAGCTCTAGGGTGCCAGGT 1380 TTATGAAGGT TATGGCCAAA CTGAGTGCAC AGCTGGATGT ACCTTCACCACTCCTGGCGA 1440 CTGGACCTCA GGGCACGTAG GGGCGCCACT TCCCTGCAAT CATATCAAGCTCGTTGATGT 1500 TGAGGAACTG AACTACTGGG CCTGCAAAGG AGAGGGAGAG ATATGTGAGAGGACCAAATG 1560 TGTTCAAAGG CTACTTGAAA GATCCAGACA GGACGAAGGA GGCCCTGTACGGCGATGGCT 1620 GGCTTCACAC TGGAGACATC GGTAAATGGC TGCCGGCAGG AACTCTTAAAATTATTGATC 1680 GGAAAAAGCA TATATTTAAA CTTGCTCAGG GAGTATATGT TGCACCCGAGAAGATTGAGA 1740 ACATCTACAT CCGGAGCCAA CCTGTGGCGC AAATCTATGT CCATGGGGACAGCTTAAAGG 1800 CCTTTTTGGT AGGCATTGTT GTGCCTGACC CTGAAGTTAT GCCCTCCTGGGCCCAGAAGA 1860 GAGGAATTGA AGGAACATAT GCAGATCTCT GCACAAATAA GGATCTGAAGAAAGCCATTT 1920 TGGAAGATAT GGTGAGGTTA GGAAAAGAAA GTGGACTCCA TTCTTTTGAGCAGGTTAAAG 1980 CCATTCACAT CCATTCTGAC ATGTTCTCAG TTCAAAATGG CTTGCTGACACCAACACTAA 2040 AAGCTAAGAG ACCTGAGCTG AGAGAGTACT TCAAAAAACA AATAGAAGAGCTTTACTCAA 2100 TCTCCATGTG AAGTTCAAGG AAAGTTCTTC TCAGTGTAAT GAACTGTCTAGCAATATTAT 2160 AGTTATTCTT GAAAGTAATG AGTCAAAATG ACACAGCTGA AAATGAATAAGCATCTGATT 2220 TTATGACTGA GCCTTTTCCT GTCCCAAGAG GTCTTTAACA ATATTTTCTCTATCATCAAT 2280 GAGTATATTT TATTTTTATT ATAAAAATGA TATTGTGGTG GACTGCTAAAAATATCACAA 2340 ATGGCAATGT AAAAATCAAG ACATTTTCTC AAGAACTGTG TACCACTAAAAGTAATATAT 2400 TGTCAATGTT CACAGG 2416 2416 base pairs nucleic aciddouble linear cDNA to mRNA NO NO TS.seq 61 CCTGTGAACA TTGACAATATATTACTTTTA GTGGTACGCA GTTCTTGAGA AAATGTCTTG 60 ATTTTTACAT TGCCATTTGTGATATTTTTA GCAGTCCACC ACAATATCAT TTTTATAATA 120 AAATAAAATA TACTCATTGATGATAGAGAA AATATTGTTA AAGACCTCTT GGGACAGGAA 180 AAGGCTCAGT CATAAAATCAGATGCTTATT CATTTTCAGC TGTGTCATTT TGACTCATTA 240 CTTTCAAGAA TAACTATAATATTGCTAGAC AGTTCATTAC ACTGAGAAGA ACTTTCCTTG 300 AACTTCACAT GGAGATTGAGTAAAGCTCTT CTATTTGTTT TTTGAAGTAC TCTCTCAGCT 360 CAGGTCTCTT AGCTTTTAGTGTTGGTGTCA GCAAGCCATT TTGAACTGAG AACATGTCAG 420 AATGGATGTG AATGGCTTTAACCTGCTCAA AAGAATGGAG TCCACTTTCT TTTCCTAACC 480 TCACCATATC TTCCAAAATGGCTTTCTTCA GATCCTTATT TGTGCAGAGA TCTGCATATG 540 TTCCTTCAAT TCCTCTCTTCTGGGCCCAGG AGGGCATAAC TTCAGGGTCA GGCACAACAA 600 TGCCTACCAA AAAGGCCTTTAAGCTGTCCC CATGGACATA GATTTGCGCC ACAGGTTGGC 660 TCCGGATGTA GATGTTCTCAATCTTCTCGG GTGCAACATA TTCTCCCTGA GCAAGTTTAA 720 ATATATGCTT TTTCCGATCAATAATTTTAA GAGTTCCTGC CGGCAGCCAT TTTCCGATGT 780 CTCCAGTGTG AAGCCAGCCATCGCTGTCCA GGGCCTCCTT CGTCCTGTCT GGATCTTTCA 840 AGTAGCCTTT GAACACATTTGGTCCTCTCA CACATATCTC TCCCTCTCCT TTGCAGGCCC 900 AGTAGTTCAG TTCCTCAACATCAACGAGCT TGATATGATT GCAGGGAAGT GGCGCCCCTA 960 CGTGCCCTGA GGTCCAGTCGCCAGGAGTGG TGAAGGTACA TCCAGCTGTG CACTCAGTTT 1020 GGCCATAACC TTCATAAACCTGGCACCCTA GAGCTGCCCG GAGAAATCCC AGAACTGTTG 1080 GTGATGCTGG GGCTGCTCCAGTAACAATCA TCCGCACACA CCCACCAAGA CTGGCCTGAA 1140 TCTTATTAAA GAAGAGTTCATCCCAGATAC TATCATTCCT GATGATTCCA CTCCGGACCT 1200 CGGCTTGCTT ACGCTTTGCTGCAAACTCCA GGAGCCAGCG CTTTAATGGT GTGTTTGCCT 1260 GGCTGAAGAT CTTGTCGTACATCCGGTTCA GCAGTCGTGG GACCACAGGG AAGATGGTGG 1320 GGCATAGAGC CTTCATGTCATCTGAGAGAA GGCGGATATC TCCCTGGAAG AAGCCAACAC 1380 GCCCTCCGTG GCAATAGACGACAGACTGGA TTACTCTCTC AAACATGTGA GCCAGAGGCA 1440 GGGAGGAGAT GAGCACATCGTCCTGTCTCG GAAAGATCAC TTTCTCTGTC ACTTTCAGAA 1500 AGCCTGAGAA ATCGGCCACCACGTTCCCAT GGGTAAGCAT CGCACCTTTT GGGTTCCCTG 1560 TCGTGCCGCT TGTGAAACACACAATGGAGA GGTCATCAGG CTGCGGGGGC ACAGGAGCCT 1620 GGTGATTCTC TTGGCCACAGTCCTCCACGG CCTGCATGGA CTTAATGACC ACCCCGCACT 1680 TCTGCCCTCT CTCTTTCAGGGCTTCTTCGA ATGGGTCCAT GAGGATGATC AGCTTGAGGC 1740 CTGGAGTCTC CTTCCTCTCCACATGCTCTA GCAGAAGCAC AGCCTTCTGA GGTTTGTCCA 1800 CAATCACGGT GCTGATGTCCGCTGTATTGA TGATGTGGCG GATAGCCCCA GGGCCCAGGG 1860 TGTCATAGAG CGGGACCACCACCATGGAAT ATGTGTAGCA GGCCAGCTCC ACGATGATCC 1920 ACTCTGGCCG ATTTTGTGCAAAAACACCAA TAAACTGATC AGTGCATGCT TTACAATTGT 1980 GCTGGAGAAG TCCGGACCCCAGAAATTCAG CCCTGTCGGC CACCTCCTGG TAGGACAGCC 2040 ACTGGTAAGG CTGCTTAGGCTTCCTGAAAC CAAGACAGGG CCCATTCCCT GAGATGCTAA 2100 GCCCACGGCG GAACACCTGGTACATGGTCC GGGCATCATC ATAGTAGTGG GTAAGTAGCT 2160 GAGGGCCAGA CCCAATCACAGATCGCCGTG CCCCGCCACT GTCCTCTACT TCTTCTGACT 2220 GCATCAGGAG GTTGCATGGCGGCTGCAAGG CCTTTGGCCG GTGAGTGAAC CAGTAGGCAA 2280 GGATGGCAGC CAGGGCACCCATACTCACGA GGGTGGTGGC CGAGAGGCTG CGGAAAAACT 2340 GTCCCAAGTC ACCTAGCTCAGGCAGTCGCA GTATCCTCAG GATCTCCTGT GTCTGCATCT 2400 TCTCAGAAGT GAGAGG 24161698 base pairs nucleic acid double linear cDNA to mRNA NO NO TS2.seq 62CCTGTGAACA TTGACAATAT ATTACTTTTA GTGGTACACA GTTCTTGAGA AAATGTCTTG 60ATTTTTACAT TGCCATTTGT GATATTTTTA GCAGTCCACC ACAATATCAT TTTTATAATA 120AAAATAAAAT ATACTCATTG ATGATAGAGA AAATATTGTT AAAGACCTCT TGGGACAGGA 180AAAGGCTCAG TCATAAAATC AGATGCTTAT TCATTTTCAG CTGTGTCATT TTGACTCATT 240ACTTTCAAGA ATAACTATAA TATTGCTAGA CAGTTCATTA CACTGAGAAG AACTTTCCTT 300GAACTTCACA TGGAGATTGA GTAAAGCTCT TCTATTTGTT TTTTGAAGTA CTCTCTCAGC 360TCAGGTCTCT TAGCTTTTAG TGTTGGTGTC AGCAAGCCAT TTTGAACTGA GAACATGTCA 420GAATGGATGT GAATGGCTTT AACCTGCTCA AAAGAATGGA GTCCACTTTC TTTTCCTAAC 480CTCACCATAT CTTCCAAAAT GGCTTTCTTC AGATCCTTAT TTGTGCAGAG ATCTGCATAT 540GTTCCTTCAA TTCCTCTCTT CTGGGCCCAG GAAGGCATAA CTTCAGGGTC AGGCACAACA 600ATGCCTACCA AAAAGGCCTT TAAGCTGTCC CCATGGACAT AGATTTGCGC CACAGGTTGG 660CTCCGGATGT AGATGTTCTC AATCTTCTCG GGTGCAACAT ATTCTCCCTG AGCAAGTTTA 720AATATATGCT TTTTCCGATC AATAATTTTA AGAGTTCCTG CCGGCAGCCA TTTTCCGATG 780TCTCCAGTGT GAAGCCAGCC ATCGCTGTCC AGGGCCTCCT TCGTCCTGTC TGGATCTTTC 840AAGTAGCCTT TGAACACATT TGGTCCTCTC ACACATATCT CTCCCTCTCC TTTGCAGGCC 900CAGTAGTTCA GTTCCTCAAC ATCAACGAGC TTAATATGAT TGCAGGGAAG TGGCGCCCCT 960ACGTGCACTG AGGTCCAGTC GCCAGGAGTG GTGAAGGTAC ATCCAGCTGT GCACTCAGTT 1020TGGCCATAAC CTTCATAAAC CTGGCACCCT AGAGCTGCCC GGAGAAATCC CAGAACTGTT 1080GGTGATGCTG GGGCTGCTCC AGTAACAATC ATCCGCACAC ACCCACCAAG ACTGGCCTGA 1140ATCTTATTAA AGAAGAGTTC ATCCCAGATA CTATCATTCC TGATGATTCC ACTCCGGACC 1200TCGGCTTGCT TACGCTTTGC TGCAAACTCC AGGAGCCAGC GCTTTAATGG TGTGTTTGCC 1260TGGCTGAAGA TCCTGTCGTA CATCCGGTTC AGCAGTCGTG GGACCACAGG GAAGATGGTG 1320GGGCATAGAG CCTTCATGTC ATCTGAGAGA AGGCGGATAT CTCCCTGGAA GAAGCCAACA 1380CGCCCTCCGT GGCAATAGAC GACAGACTGG ATTACTCTCT CAAACATGTG AGCCAGAGGC 1440AGGAAGGAGA TGAGCACATC GTCCTGTCTC GGAAAGATCA CTTTCTCTAC TTCTTCTGAC 1500TGCATCAGGA GGTTGCATGG CGGCTGCAAG GCCTTTGGCC GGTGAGTGAA CCAGTAGGCA 1560AGGATGGCAG CCAGGGCACC CATACTCACG AGGGTGGTGG CCGAGAGGCT GCGGAAAAAC 1620TGTCCCAAGT CACCTAGCTC AGGCAGTCGC AGTATCCTCA GGATCTCCTG TGTCTGCATC 1680TTCTCCAGAA GTGAGAGG 1698 2416 base pairs nucleic acid double linear cDNAto mRNA NO NO FL.seq 63 CCTGTGAACA TTGACAATAT ATTACTTTTA GTGGTACACAGTTCTTGAGA AAATGTCTTG 60 ATTTTTACAT TGCCATCTGT GATATTTTTA GCAGTCCACCACAATATCAT TTTTATAATA 120 AAAATAAAAT ATACTCATTG ATGATAGAGA AAATATTGTTAAAGACCTCT TGGGACAGGA 180 AAAGGCTCAG TCATAAAATC AGATGCTTAT TCATTTTCAGCCGTGTCATT TTGACTCATT 240 ACTTTCAAGA ATAACTATAA TATTGCTAGA CAGTTCATTACACTGAGAAG AACTTTCCTT 300 GAACTTCACA TGGAGATTGA GTAAAGCTCT TCTATTTGTTTTTTGAAGTA CTCTCTCAGC 360 TCAGGTCTCT TAGCTTTTAG TGTTGGTGTC AGCAAGCCATTTTGAACTGA GAACATGTCA 420 GAATGGATGT GAATGGCTTT AACCTGCTCA AAGAATGGAGTCCACTTTCT TTTCCTAACC 480 TCACCATATC TTCCAAAATG GCTTTCTTCA GATCCTTATTTGTGCAGAGA TCTGCATATG 540 TTCCTTCAAT TCCTCTCTTC TGGGCCCAGG AGGGCATAACTTCAGGGTCA GGCACAACAA 600 TGCCTACCAA AAAGGCCTTT AAGCTGTCCC CATGGACATAGATTTGCGCC ACAGGTTGGC 660 TCCGGATGTA GATGTTCTCA ATCTTCTCGG GTGCAACATATTCTCCCTGA GCAAGTTTGA 720 ATATATGCTT TTTCCGATCA ATAATTTTAA GAGTTCCTGCCGGCAGCCAT TTTCCGATGT 780 CTCCAGTGTG AAGCCAGCCA TCGCTGTCCA GGGCCTCCTTCGTCCTGTCT GGATCTTTCA 840 AGTAGCCTTT GAACACATTT GGTCCTCTCA CACATATCTCTCCCTCTCCT TTGCAGGCCC 900 AGTAGTTCAG TTCCTCAACA TCAACGAGCT TGATATGATTGCAGGGAAGT GGCGCCCCTA 960 CGTGCCCTGA GGTCCAGTCG CCAGGAGTGG TGAAGGTACATCCAGCTGTG CACTCAGTTT 1020 GGCCATAACC TTCATAAACC TGGCACCCTA GAGCTGCCCGGAGAAATCCC AGAACTGTTG 1080 GTGATGCTGG GGCTGCTCCA GTAACAATCA TCCGCACACACCCACCAAGA CTGGCCTGAA 1140 TCTTATTAAA GAAGAGTTCA TCCCAGATAC TATCATTCCTGATGATTCCA CTCCGGACCT 1200 CGGCTTGCTT ACGCTTTGCT GCAAACTCCA GGAGCCAGCGCTTTAATGGT GTGTTTGCCT 1260 GGCTGAAGAT CTTGTCGTAC ATCCGGTTCA GCAGTCGTGGGACCACAGGG AAGATGGTGG 1320 GGCATAGAGC CTTCATGTCA TCTGAGAGAA GGCGGATATCTCCCTGGAAG AAGCCAACAC 1380 GCCCTCCGTG GCAATAGACG ACAGACTGCA CCATACGCTCAAACATGTGT GCTAAAGGCA 1440 AATAGGAAAT GTGCACATCC GCACAAGTGG GAGCCCACTGACTCTCTGTC ACTTTCAGAA 1500 AGCCTGAGAA ATCAGCCACC ACGTTCCCAT GGGTGAGCATCGCACCCTTT GGGTTCCCTG 1560 TCGTGCCGCT TGTGAAACAC ACAATGGAGA GGTCATCAGGCTGCGGGGGC ACAGGAGCCT 1620 GGTGATTCTC TTGGCCACAG TCCTCCACGG CCTGCATGGACTTGATGACC ACCCCGCACT 1680 TCTGCCCTCT CTCTTTCAGG GCTTCTTCGA ATGGGTCCATGAGGATGATC AGCTTGAGGC 1740 CTGGAGTCTC CTCCCTCTCC ACATGCTCTA GCAGAAGCACAGCCTTCTGA GGTTTGTCCA 1800 CAATCACGGT GCTGATGTCC GCTGTATTGA TGATGTAGCGGATAGCCCCA GGGCCCAGGG 1860 TGTCATAGAG CGGGACCACC ACTATGGAAT ATGTGTAGCAGGCCAGCTCC ACAATGATCC 1920 ACTCTGGCCG ATTTTGTGCA AAAACACCAA TAAACTGATCAGTGCATGCT TTACAATTGT 1980 GCTGGAGAAG TCCGGACCCC AGAAATTCGG CCCTGTCGGCCACCTCCTGG TAGGACAGCC 2040 ACTGGTAAGG CTGCTTAGGC TTCCTGAAAC CAAGACAGGGCCCATTCCCT GAGATGCTAA 2100 GCCCACGGCG GAACACCTGG TACATGGTCC GGGCATCATCATAGTAGTGG GTAAGTAGCT 2160 GAGGGCCAGA CCCAATCACA GATCGCCGTG CCCCGCCACTGTCCTCTACT TCTTCTGACT 2220 GCATCGGGAG GTTGCATGGC GGCTGCAAGG CCTTTGGCCGGTGGGTGAGC CAGTAGGCAA 2280 GGATGGCAGC CAGGGCACCC ATACTCACGA GGGTGGTGGCCGAGAGGCTG CGGAAAAACT 2340 GTCCCAAGTC ACCTAGCTCA GGCAGTCGCA GTATCCTCAGGATCTCCTGT GTCTGCATCT 2400 TCTCAGAAGT GAGAGG 2416 406 base pairs nucleicacid double linear cDNA to mRNA NO NO FL2.seq 64 TGGTGTCAGG GGNCAACAGAGGCCGAAGGC GCCCTCTTGA AAAAAAATAA GCTACAAGAT 60 GAGTAGAGTG GTTTACACAGAGGACTGTGG AGGTGGTGGG TAATAAACTT AAGCACCAGT 120 TTTAATCAAG TACGGGCTGGATAATTAGAC AAGATATNGG NNNCTGAGCC TCGCGTCAAC 180 TGAATCGGCA GCTCGGCCGCCTGTTGCCAC AGGCTCCTTT CTCCACGGCG TCCTTGCGGG 240 ACCGCCAGAG TGTGCTTGGCTTCCGCGTAT CCGTGTGTCT GCGCGTCGCC GGCGACTGTC 300 CCGTGTTTCC CTGTGAGGCTGCCACGCCCA GGCGTGCATG TGCGTCTCGA GACCTGTGGA 360 CCTGGGCGGC AGAAAGGCTTCCCGTGGCTG TTGCCCGCTG ACACCA 406 234 base pairs nucleic acid doublelinear cDNA to mRNA NO NO G205a.seq 65 CAGGCTCATG GTGTGGAAGT CAGACCGGGAGTCTCCTGGA GCAGACTCAC AGTGTAGGGG 60 GTCAGCAGAG GCAGCAGCTT TGGGAATCCCGGCACTGCAG CCTCAGGGGT NGGCTCGCTG 120 AGTGGGTCAA GGTCTTTAGG GTTCTTGGGCCCAGCCTTGG AGCCTGCCCT CCCAGCCCTC 180 CTGACATTCT TAGAAGCACC TACTTTCCTGCCTGAAATCC TTTCCTGATT TAAA 234 161 base pairs nucleic acid double linearcDNA to mRNA NO NO G205b.seq 66 GCTGTATAGT ATTCCACTGT GTATATATAGTTTTGGATCT TATCGCAGTG CCTCAAGTTC 60 TGTGAAGGAG AGAATCTGGA TAATTGTATCAGGAGGTCCT TAGACCATAT TTAGGATCCT 120 TCCATTGGGA CTTGGGCAGC AAGGTTACCAAAACTTAAAT G 161 445 base pairs nucleic acid double linear cDNA to mRNANO NO G205c.seq 67 CTCTCCTTCT GCATCCCCGA CTCTCCTTGA GAACCTATTTGGCAGAAGCT CTCCACCCAG 60 CAAGTCCGCA GCTTGATGAG CTCCCTCCTG TGTTAACTGGAACCCCTGCT GTACTTCATT 120 CCACATAATA GTTCATCGGA TCCAAAGTCC CCACCTGCTTTGGAAGCAAC CACCTGCTCT 180 TCTCATAACT CTCCTCCAGT TTGTGCAGTG AAGAATCAACCTTTATCCAA GAAGTCTGGC 240 CTTTGCCCTG GCTCTTGGGA GGTCCTACCA GCTACAAACCCTTGGAGTAA ACAACGTGGC 300 TAGTCCTTGT CACCAGTTCC CAGGAGGTAG CCCCAAATTCCTAGGGATTT CCCAAGTGAT 360 AGGAGTATCT TATTACTCAT GGTGGTCTCT GAGAGTTTATGTGAGTGAAG TGGCTCATGG 420 TGGGCCCTAG GTAGTTTTTG CTGAC 445 1661 basepairs nucleic acid double linear cDNA to mRNA NO NO G221.seq 68TCCAGGTCAG CTACACACTG TATGCCTGAG CCCATGTTAA TCTGTTGTAG TGAAGACACA 60TTTGGCAAGC ACAGTGGCTG CAAATGGGGC TACTGTTGAA TTGGTAGTAG CCTGCTGTCA 120TCTTTCAGGA TTTAACTGGG TTTTGCTGGT AAATTAAATG AAGATATGAC GGACTCCCTT 180TTAGGTCTAT AATGGTGGCA ATAATTTCTG CCATTAACCC TGGAATGTGA TATTGTTCTT 240GATTTACTAT TTTGCCTGGA ATGGGAAGGT TTTGAAGGCT TCCACTTGGC TTTCTTCACT 300ATGATCACTC TCACCATGCA GGCCAAGGAC CCAATGTAGG GGCTGTGCTA ATTATCAAGT 360GTGTTGATCT GATTATATCA CTGTGGGTCT GTGGACCCAG AGGACCTGCT ATTAGCAGAA 420CTCAGGACAG GAATCTATCA CCTGGCCCAC ATATACCCCT CATTCTAATG AGAGGATTGT 480GGCGCCACTT CAGGTACCTA GGTAACAATG TCAATTCTAA TCCCGGGTTC AATAGTCTTT 540CACATGTTAG GATATTCTCT TTCCTAGTAT GTAGCTATCC AAGTTAAGTG CCCACAGGTC 600CCCCTGGGAA AGGGCTGGGG AAATCATTAT CATGCACACT TGCTGTGGTG GTGTGGGACT 660CTTCCTTCTG GGAGATCTGG CATCTTCTTC AATCAGTAGG TTCTAAGTCT GAAAACTGGT 720TCAGATCTGG AAACTCGGCA GGTTACTTTT TATTGGGGTG ACTGGACTCA TTATCCATGA 780TTGATTTCTT TTTCTTGCAC AGAATAACCC GCAGCACTGT TGGCTGGCCA TCTATTGTGA 840CCCTCCAGAT GCTATGTTTT ATTAACCATG TCCATAACTC CCCGGGAGCC AGGCCCTCTT 900AACAGCCATT CCAATCTTGC CACCCAGACA ACTGGGCCAA GTGCTATGAC TTGGCCTCTA 960TTATCTCAGA GTCCTATCAC CCCTACTGCT ATCAGTGAGC CCAGTTCTGT GACATTTCCT 1020ATCAATGCTG GCCTATTTTA GCCACCATTT TAGTTCTTGG TGATACTGGT GCCCTTCTTT 1080CCAATACATT TCTTGTTTTG GTATCTTCTT GGCCCTCAGG CGAGTATTCT GAATTTGCTT 1140AGTATATTTA TTCCTGCACA CCCACGTCCT TCATAGTCTG CCATAGCCAT TCTGGCTACC 1200CTCACTTAAT ATGGGCCATC CTCTTTCCAT GCTTCTAGGA GCCATCCAAA CGATGTATTT 1260GCACCATCTC TTGGGGCTTT GCCACAGTTT TAAATACTTT ATCCTGGAAG AATGTTCCCA 1320TATCCATAAA CTCTTTTGTT CAAATTTCTG TTCCACCCCC TTGGTCCAGA ATCCAGTCCT 1380GTGCATACTC TCCAAGCTCC TGCTAGTACT CTCTGGATAT GACCAGAATT TCCTTTGAGG 1440TATAGTCCTT TTCTACACTA GCAAGTTCAG CATGTCTGGC CAGGTTAGGT TTTGACTTAA 1500CCCTAGTTAT TGGCCTAGCA CAAAGAGGTA ACATGGAATG AGAGCATGTC TTGCAGGACA 1560GAAGCCTCCG CATTTTCTTT TTAAAGTAAG AAGCAAGTTC TAGCTTTTAA CAGAGATGAG 1620TAGGTCACTT CTGCAGGTCC AGAGTGTTTA GTGAAATCTG A 1661 1379 base pairsnucleic acid double linear cDNA to mRNA NO NO G238con.seq 69 CTCTGTGCACTGTAGNGTCT TCTTGTGTAC CAGCACTTTT TTCTTCCCAT TTCTTTCTCC 60 ATGTTTACCATCAACACTGT GAGAATAACT GAAGTTTCCT TATCCAAAAA AAGGGTGCTA 120 CTCCAGATTCCGCCACTACT GTTTTCGAAA AGCACAAAAC CAACAGAGCC TCAGGGGGGG 180 CTGACCCTCTTTCTGGATCC CACCTCCATC CCCCCGAACT TACGTTGTGC TTTCCTCAGC 240 AGCCACCGAGGCCTCCAGTT CCGTCTCCAA AGATGATGGG TTCCTTCCAG TGGGTGCAAA 300 GTGAGAGCCCCAGTGATTGG TGTTCATAGT GGGTCAGTGT GAACAACGCC CAATGGCCTG 360 CCTGGGCCAGCTGGGGCCTC GTTTTGCTTT GGTCTGAAAG ACATTTTTGT TTTCTGGTGA 420 GAACAGCCCCAGCCTGGCCA GGAGCCGGCC AGCGGCAGGA ATACAAACCC TTTCATGTGA 480 CAGACCCAAGTGGAGTGATG CCCTCCTGCC AACAGCAGGC CCTCCCCTGC CGCTCCTGGG 540 AGGCGGCCATTTGCATATTT CCCATTCATC CTGGCCTTGA AAAGGAGGCC TGAGTTCCCA 600 GTGCTCCTGCCCGCTGAGGG CTGGGCCGCT CCAGTCTAGT GTTAACTCTG TGGACACTGT 660 AGGCTCTCACAGGCCAACAG CAGAACTTGA CCGCTTGCTG CCGGAGGGAG AGCAGCTTAA 720 GGCTGCAGCTGCTGTGCCGC CTGACCTCCA GAGGGGGGAT TCAGGAGGTG GCAGATTCCC 780 GTTGACCAGCACAGCCTTTT GCTAAACTGG AGGAAATTCA GATTTGTTTC TTCTTGAGGC 840 ATTCAGAAGAGGAATTTTGT CAGACTATGG GGATGCCAGA ATATTCAGCT ATTTACCAAA 900 TTTGCCAGAAAATGTGCCCT TAACCAAGGG CCAAACTCTT TTTGTCTTGC TCACTTTCTA 960 GTCTACAAAAAAATTCAGTG ACTCTGGAAT GGTAGGTGAA GGAGCCATGC CGGATCCTGG 1020 CTGCAGCAGCAATCCCTTTG CCAAGGATGT AGGAGCACAG CTTGCCTGGG GCACTTTTGC 1080 ATCCCCAGGGCTGAGTGCCA TTAGCTTGTG GGCTGTGACT CTGAAGGCAT GAGGCAGATA 1140 TACAGTACCCATCACCATCT TTTTCCTTTC TCCCATAGCT AAGTGCCATC CTGCCAGCCT 1200 CAGCTTCCTGCCCCAGTCCT CAGTGCAGAC AGGCCTCTGC CTCCTTCCCG CCACTGTGTG 1260 AGGGCTCCTGCCAGGGGCCC CAACATCTTA CAGGCTCTTC CTGTGACTTA CCAACCCACT 1320 TCTGTCCCTCTTCGATAGCC CTGTTCTCTA CCCTTTCCCA CCCAGCTCGG ATCCTCTCC 1379 2661 basepairs nucleic acid double linear cDNA to mRNA NO NO G229con.seq 70TCTCTAGACC CTTCCTGCTT CTCTCCCCAC AGCAGCCACC CATTAATTCA TTACATAAAT 60ATGTACTGAG CACCTACAAA GTGTCAGGAA CTGTGCCAGG CTCTGAAAGG AAACCAACCC 120TTGAGGAGCT TACACTTTAG GGGTTAGTCT GGCTGTGTGT CATAAGGGTT ACCTGCCAGC 180TCAGCATGAA CTCCAAGCCC TTTGGTCTGA CCTAAGACCC TCCCTTACTG CCCTGGCCCC 240GACAGCCCCT GCAGATACCT CTAGCCTCAT TGCTCACCAT CCATCTCCTT CATATCAGCC 300CTGGACCCCA GTAACACGAC CCAGACCACA GCACTCCCAC CATCACCTGC TACCCCTGCC 360TGGAATGCAC CCCAGACCAA ATGGTTGACA TCTGTCTCCA CAATGGGGTA AGACTAATGT 420CCAGCGGGAG GTGGAGAGGT CCTGCACTGG GGCCTCCTCT TCCTCAACAC CTACTCTGAG 480GCTTGCCTGC CCACCTCTTA CGCCACAGGG GTGGACTGTT ATCTGTTCCT CAGGGGACTG 540TGAGGGTCTC TGCTCTGAAC TACTGCTTTA TCCCCCAGCT CGGAGGAGGG CCCCTCATGG 600CATCAAGTGC CAGCAGTGAC TATGTTCCAG AGTCTGATGA AAGTGAGCCT CTTTTCACCT 660TTGAATAAAA GAAATGCACA CAGCTTTTAC AGAAGTCCGG ATGGAAAGGC AACATCCAAT 720TTTCCAAAGT TTAGAAAATG TTCTTGGGAC CAAGATCAGC AACAGGCTAT AAGCAGGTAC 780TAAGTACACA GCCAGGGCTG TTGTTTTCAT TATTCTTATC AAAAATAGCA TCTGTGAGGG 840AGCCAAGAGG AGGCCCTTGG GGCCATCCAG GAGCCAGGGG AACTGGGAGC CCAACACCAG 900CACAGCTGCC AGCTCTTTTT CCCACTTAAC GGATTCGGGA ACCATCTCAA AGGAAGCTGC 960AGGAGGGAGG GAAGCCCAGC TCTCTGGGAA TGTGTCACAC TTCCTCCAGT TAGGCCTGGG 1020GCAGCCCCAA GCTCTCCTGA TGGAGGCCCT GGCTCCTATC CAGGCCTCTT CCTCTACCAG 1080ACTGGATAAG GGTGAGGTCA TGTGCTGGGG AAGGGAGGCC AGGGAAGCAG CAACTGGGTT 1140GGAGCCAGTC AGAAACAACA CAATAACAGG ATAACTCATA GTCTCCCCTC TCCCCTTACA 1200CTCCAGGAAG CTGTCCCTGA GTGAACTCCA TACCCCTCAG GTCCCTTCTC CCACTGGGAC 1260CTCTCTGGGG CAGATTCTGT GGGTGCCTCT TAGTCCTCAA CTGAAATGGA AGCTCTCTCT 1320CTTCTCAGGG CTAGGGGCAG CACTGTGAAT CAGACAGACC CTAATGCCTC CTCTCACCAA 1380TCCAGTCCTG GACATGGGCA GCAACCAGTG TTGGAACCCA GGTGGAAATA AGAGGAAGCT 1440GCCAGAGCCT CGAGCCATAC CCTGGGCCAT GGTCACACCA AAGGTTCTTG TGCCTATGGG 1500GCTGAGGGAC AGAGATATGC AGCCTTGGGC TCTGAGATCA AACAAAAATG GGTGTGGGCC 1560TGGGTCCCCA AGTTACAATG AACCCCCCTG TTAGGAAGGT GCATCTGACC TTAGACTCTG 1620TCAGGCTGAA GGACCAGGTC CCCAAGTTAC AATGAACCCC CCTGTTAGGA AGGAGCATCT 1680GACCTTAGAC TCTGTCAGGC TGAAGGACCA GGAGTCACAA GCAGACAGAC AGACACAGCA 1740GGACCATGAC AGGGGCAGAC AAACAGATAG GCATAGCTCA GGCTCCTGGC AGTGATGAGT 1800AAACGGACAG ACACTGATAG ACAGTTAGAC TCAGCGAGAG CCTGGAAAGG ACAGATGGAG 1860AGACAAGAGG GAACGCTGGC AGTGAAAGAC TGACAGACAT AGAGGAGATG GCGGACTTGG 1920CAAGAGCCCC TGGCAGGGAC AGACAGAGAC GCAGTTGCAA GCTGTGGTCA GGTTAAAATG 1980TGGCCATTCT GTCTCTGAGC TCAGCCCCTG ACTGCAGATC CCGATTCTCT TGGAGGTTCC 2040TCCTCTTGGC ACTGTGATCA GAGACTTTGT GGGACTCTTG GGACCCATTT CTCCAGGACT 2100ACAATGCCCT CAACCCCACA AGTCCCAGGA AGGTAGTAGG CTGTGGCCCT CACTGTCCCT 2160GGAGTCAGAC TCAAGAATCA ATCCATTCTC CTGGTTTTTT CCTCCCCTTC CTGGCCTGTG 2220GGGCAGAGAA AGCCTCCTCG ACATCTCTCC TGGGGCCACC TACTCCCAGC ATGGTGGCTG 2280TGCTTGTCGT GGAAAAGGTC CTTTTAGGAA CCACTATGAG TCCAGACTCT GTTGGCACAG 2340GGGGCGGTGC CCAGAAGAGG CTATAGTCCG GCATTTGCAC GACTATCCGA GGATGTTGAG 2400CTCCACCTGG CCGCCTTCTC TTCTCACCAC CCCTCATGAC CTCCAGGCCC CAGAGGCCTG 2460AGGGCCTAAA AGGTTTTGAC CCAGGGGAGC AATTCCAGGC CAGGTGAGGA TGGGGTGATT 2520AGTCCCCTTC ATAGCTGCAG AGACTGAAGC TGACTTGAAC ACACTCTGCT CTGAGGCTGT 2580AGGGTCCAAG AACCCCCCTG GGGTGAGCTG AGGTTTTTCT ACTTTCAGGG GACCGTTGTG 2640CTGAAAGCAT GACGAGGCTG C 2661 863 base pairs nucleic acid double linearcDNA to mRNA NO NO G248.seq 71 CTATGCTAGA GAAAAAGGGA GGTAGTGGTTTCATCCGCCA CTACTACCTA TGGATGTGAA 60 CAGAACCTCT GCTCCTGATG CAGACCCCTGGCCCTTTCCC AGCTCCTATT CTGTTTTGAC 120 TTCTGCACAC CCCTTTTTCT GACCCTGATACTATCCCAGA TCATTATTCT TCCTCTAGTC 180 CTACCCTTGT TCTAGCCAGT GCCCCAGACCCAAGGTGAGC TAAGGGACAG TCTCTCAAAG 240 TCTGGGCAGA GAGCCTCAGG AAGTTGGGGTATGGCTGAGA GAAGAGGGGA GTGCAGGGGG 300 ATAGGCATAC AGACTCTGAA TGCTTGACCTTCCTTATTTT CTGTCTTTGA ACTTATTTCA 360 ACAGAGGAAC CCTTATCATC TAGCCCTGTGGCTCTCTAGT ACCTTGTACC TGCTTCCTGT 420 CCCATAATTG TGAGCGTTTA GCTGTGGTGCAGGTGAGAGA CCCATTCTCC CACCCTCAGG 480 AGCCAGGAAG GCCCACCAGT ATGGCAGGGAGGCCTAGGCA GAGATATACA GGAGAGCAGA 540 GACGTCTGGA GCTAGGTCAC CGGTGGTCAGCAGGGCCTCC TGCAGAGGGA GCAGCCTCCT 600 TTGGCCTTTG CTTGTCTGAC TTCTAATGATCCTGTAAAAA TTAGTTTTGT TTTTTAAGCA 660 CCCCAATGAT GCATGAATAC ACTCTTTTGTCAAATCTTAA AAAGAGAAAA TCCTTTTTTT 720 TTTAAATAAA AAAGAAAGTT ATTTAGTCTTAAGATTGTAA AACTGTAAAG TTAAATAAAG 780 TGGCCGCCCT TTGGCTGCCC TGATCCCCATCCCCTACTCC AGCTTCTGCA AGTAACCACA 840 ATTCTCAGCT AGGTGTATAT CCT 863 1378base pairs nucleic acid double linear cDNA to mRNA NO NO G248a.seq 72GCCGCCCGGG CAGGTTGCTG AGTCTCTGAA GGTAGGAGTG GGAAGTCTCG CATTGGAAAG 60GCCTTCTTAG GTGCAGTAGT ATTTGTTATT TTACACCTTA ACCTCAAAGG AAGTCCTTCT 120TTTTCTTGGG ATGGAGCACT TTAGTTCTCA TAACTCTTCT CTGAAGTCAT TGCAGAGTGG 180GTGGAGGAAG GTGAGGGTGA TGCTTGGGTC TGAATTTTCT TGGTAAACTT ACAAGTGGAT 240CTATCAAAAA CCAGAGGGTT TTTTCTTAAC CACAACACCC CCCAGAATTC CATTTCCTGC 300AGATGTAGCA GCAGCACGTC TAGCCATCTT GGCCCAGGCC TCTGGACCAT GCCTTGGGAG 360GGCTCTGCCC TCTGCCTTGA GTTCCATTAG AACTTCTCCA GTGGAAAGAG TGAGTTACTT 420TGCCCTGGCC TGGTGGGCAG GCTTTTTCCT CTCTGACTTG GCTAAATGAA ATGGGATTTA 480AGGTAGCTCT CCCTGTGGGT AAAAGACATT TTGCTCTATG CTAGAGAAAA AGGGAGGTAG 540TGGTTTCATC TGCCACTACT ACCTATGGAT GTGAACAGAA CCTCTGCTCC TGATGCAGAC 600CCCTGGCCCT TTCCCAGCTC CTATTCTGTT TTGACTTCTG CACACCCCTT TTTCTGACCC 660TGATACTATC CCAGATCATT ATTCTTCCTC TAGTCCTACC CTTGTTCTAG CCAGTGCCCC 720AGACCCAAGG TGAGCTAAGG GACAGTCTCT CAAAGTCTGG GCAGAGAGCC TCAGGAAGTT 780GGGGTATGGC TGAGAGAAGA GGGGAGTGCA GGGGGATAGG CATACAGACT CTGAATGCTT 840GACCTTCCTT ATTTTCTGTC TTTGAACTTA TTTCAACAGA GGAACCCTTA TCATCTAGCC 900CTGTGGCTCT CTAGTACCTT GTACCTGCTT CCTGTCCCAT AATTGTGAGC GTTTAGCTGT 960GGTGCAGGTG AGAGACCCAT TCTCCCACCC TCAGGAGCCA GGAAGGCCCA CCAGTATGGC 1020AGGGAGGCCT AGGCAGAGAT ATACAGGAGA GCAGAGACGT CTGGAGCTAG GTCACCGGTG 1080GTCAGCAGGG CCTCCTGCAG AGGGAGCAGC CTCCTTTGGC CTTTGCTTGT CTGACTTCTA 1140ATGATCCTGT AAAAATTAGT TTTGTTTTTT AAGCACCCCA ATGATGCATG AATACACTCT 1200TTTGTCAAAT CTTAAAAAGA GAAAATCCTT TTTTTTTTAA ATAAAAAAGA AAGTTATTTA 1260GTCTTAAGAT TGTAAAACTG TAAAGTTAAA TAAAGTGGCC GCCCTTTGGC TGCCCTGATC 1320CCCATCCCCT ACTCCAGCTT CTGCAAGTAA CCACAATTCT CAGCTAGGTG TATATCCT 1378 797base pairs nucleic acid double linear cDNA to mRNA NO NO G248b.seq 73GTGTATATCC TTCCAGACGT CTTTCTATAC ATTTACTTTT CCTTATTGTT TAAACCAATG 60GTGAGTTGTC TTTTCTCTTA CTTAAATCTG AAAGTGTTCC TAACCAATTT AATAACAATT 120GCCTCAGTGC TGTTTATTGA AAGGTTCTTC GTTTCATACT GACATAAAAC GCCAGTTGTG 180TTAGACCCTG GCCAGGCCTG CTTCCTCAAA GACCCAGAGT AAACATGAAC TGTAAACTCC 240AAAACTGTAC AACTAGTTTT TAAAGAAAGA TTGCCCAAGA TACTGGCACA AGACTTTTTA 300AGGCCTAGGA TTTGCATATT AGACCTATGT AATGTGGCTT ACTGAAGAGC AGAGTTCTTG 360CTTTCTTTGG TAGTGTAAGC TCTTTCTGGT GCTCACACAG GAAGGACTGT AAAGGGCAGT 420GAGGGCTCGA ATCTGGACTC TTCTGACATG AGGGACATCT CATTTTATGC AGGCTGCCAA 480GACCATTGAA CTTGGAGGAT GCCTTTGTGA GAAAGCAAGA AAGGCAGTGG GGAGCTGCAG 540CCCCCACATG CACCTTCATC TCAGGAACAT CCTTTGTACT TTTTTTTTTA ATATTGTACA 600GAGCTGTTTT TTTTTATTAT ACTTTAAGTT TTAGGGTACA TGTGCACAAC ATGCAGGTTA 660GTTACATATG TATACATGTG CCATGTTGGT GTGCTGCACC CATTAACTCG TCATTTAACA 720TTAGGTATAT CTCCTAATCC TGCCCGGGCG GCCGCTCGAG CCCTATAGTG AGTCGTATTA 780GGATGGAAGC CGAATTC 797 403 base pairs nucleic acid double linear cDNA tomRNA NO NO G248c.seq 74 CATTGATGGA ACCAATACAG AAAAAGGATT TTCATCATCCAGGCCTTCTT CTACAGCTGA 60 AAGACTGGCA GCTGGTATAC AACTGTTCCC TGCAAGGATTGGGAGTTAGC AGCTTTATGG 120 ATAAGGGCAA TGCTAGTGCT TGCTTCTGTT CCTTACTAATAAATATCGTT TGTGACACTT 180 TTTTTCAGAA TAGGGCATTT TTGTCTGTAT TAAAAACCTGTTGAGGCAGG TATCCTTTGT 240 CCTCAATTAT TTTCTTAATG ATACCTGGGA ACCTATCTCCTGCCTTTGGT CAGCAGAAAC 300 TGCTTCTCCT ATTACCTGGA TATTTTTAAG GCCAAACCTCTTGCTAAAAT TATCAAACCA 360 TCCTTTGGTG GCATTAATTT TCAAGTTTAG CTCCTTCAACCTC 403 1083 base pairs nucleic acid double linear cDNA to mRNA NO NOG220a.seq 75 GGAAGACACT GATCATCTGT CACAAACTTG GTGAGTCATA AATAGTGCCACCTAAACCAT 60 GAGATAAACT GGGGGTGCAC CTGGAAACCA GGTAGCCCCC CTCAAGGGCAGGGGCTTTTG 120 TATTATAGGT CTGCTGCTAT TCTCCCAGTA CCCTCAATGG CACATGTCATGTAGAAGAGT 180 CTCAGTAAAT ACCTGGTGGG TGACAGAATG GAGGTGGGTG ATTCTGTTGATGAGCAGGCT 240 GGCACCGATG AGCATGGAGT ATCTGCCAGC CCTAGGATGG TGCTGTGTCTGTGTCCATCC 300 ACTGTATGGT TACAAGAACT CAAGGTACCT GGGATCCCTC AGTCCTCACAGACCAGCTCC 360 CAAGCTGGGC ACAAAAGACA ATGTATGTTG AGTGTTGTTT CTGACATGAGACTACTGGGA 420 CAGTAGGTGT CTGCCTGCTC CAGTATGAAG GATCCCACTA CTTTGTCACTGGATGGCTTT 480 GGGTTGCAGT GGTTTTCTTA CCAAAGCACA ATGACCCTTC AGTGGGGTCAGCTTCAGCAA 540 GATAAAGGCC TGGCCTGAAA CAGGTGTCTT CTATAAGAAA GACAGAGTTGTGTCCATTAT 600 GCCTCTCTCG CTGCTTCCTG GTAAAGGGAC CTAGGCATCC CTGGGTGACTGGAGTGCCTG 660 GTGACCACTT CCATCCACCC CCATTATCTG CTGCTAAGGT ACACATGAAGTATCTTAGTT 720 CCCAGAAAGA GAACCCCTGT TGAACAGTAA CAAGCCCCAG CATAGGGTGCTAATGATTTA 780 TCTGCTTTCA CATTTGAGCG TGCTTTCTTG GAAGTGATGG AGAATCTTCGGCCTGAAGAT 840 GTGGAGGCGC ATGCAGAGTC CTGAGCTCCC CACAGGCAGC TTAGGTGTAACTGAGAAGGA 900 GCTGTGAGCA TATCTGGCTC TCCAGCTCCC ACAGCAAGCG GGGTCCACCAGTATTGACAT 960 GGCTCTTGTC TGCATGATAA GCTGGACCAA CAAGGCCAGG GCTCTGCCCACAAAGCTAAA 1020 CTAGTATGGG GACTTGGCAC TTGTCCCTGT CAGGGGAATG GTAAGCATTTTCGCACAGAC 1080 TAG 1083 854 base pairs nucleic acid double linear cDNAto mRNA NO NO G255.seq 76 CTCAGTTTGG GCTCTGGGCT AATCATCTGT CCTGTCCTACTTGTCTTCCA CAAGGGACCG 60 ACTGGTCATG AAGGCCATGA AGGCTGTCTG CTGTGTGTCCCTGGAAATGT CTGTCGACAG 120 CCTCTCTAGG CAGAAGTGTT TCTTCTTGTG TACCAACCAAGACCTATAGG CCTCGCCTCA 180 TCTCCCAAGC TATACCTTAC CACAAAGCAG AACAAGGGTGGACTAAGAAC TGGCAGAGAT 240 TTATACTTGG CTTTCCAGAG GTCCCAGGTT TGTGGTAGGGGTTCATGAGG CTGGCTGCTA 300 TCTAGATGAG ATATGCAGAG TGAGCTCCTT TCCCTGAATGCTGGGCATCC CATCGGTAGT 360 ATGGGACAGG GTAAGCTCCT GGCCTGGCTG GCTCCAATGCTGCCTGAGTG AAGCTATGTA 420 ACCCTGGGAC ATCTCTCTTA GCATGCTGAT ATTTGGCTGCTTCTCTGATA ATGGGAGCAG 480 CATTCTCTGG TACGGGGTGC TGTGGAAGAC CTAGGGAATGGGACAACAGA TTAAAATGGG 540 CTTTGAAGAC CCTTGGAGAG GTGACCAGGG AGGCCCAACCTTCTATTTCC TGTGCTCAGG 600 CCTTGGGAGA GACAGAAACC ACGAGGGTCC AAGGTCCCCAACCAGTGGGA CCCCGACACC 660 AGGAGGACAG GACTCTCAGA GTTCTGTGCC TACTCCTCAGTTTCTTTTGT GTCTGCTGCT 720 ATCAGGAGTC CCAATCTACA GGGCTCAATC AGGATGGGATCCTTAGTGTG GCACCTGGGT 780 CAGAAAACCG CCCCTGCTAA GAGGCTCAGG ACAGGGCTAACTGGGAGAAG AGGCCCCACC 840 TAAGTGTCTG CCAC 854 605 base pairs nucleicacid double linear cDNA to mRNA NO NO G306.seq 77 CCCGCACTGG AACAGAAGCTTCGTGAGAAC AGGGACTTTG TCTTGTGTGT TCGCTCCTCT 60 CTTCCCAGGT CCTAGAACAGTGTCTCGTCC ATAACAAACA CTCAATGAAT ATTTAGGGAA 120 TGAATGTCTG CAAGATGCTGAGAATCTCTC ATAGAGTTTT CATATGTGAC CCCTCTTTGA 180 AATTGGGTAT TATCAAGTTATTCATTTTAA TGATTCAACC TAATTCAGTA ATCAAGCAAA 240 TTGGCAGAGA CCTAAAATATTTACCCGTTG TTGTGAGGAT GAAATAAGTA AACAAGTGTA 300 AGTTATTTAG AGCAGTGTCTGGTAACCACA GCCCTGTGTA AGAGTTTGCT GCTGTTGTTA 360 AGAAATGCTT GACTTCTTGATATCTTAAAG TTTTTGCTGA CTCTGCTGCC TGTGTTGGGA 420 TCCCAAGCAG AAACTGTTTGTGGCCCAGCA GGTGTTGGCA CTGGGTGAGT GCTTCTGGCT 480 CTTGTCCCAC GACGGACATCCAGGTCTTCC AGCGGCCTGA GGATATAGGA GGGGCTTCAG 540 GCGGATGATT GTGGCCGTTGCTTATGTTTT TTCCTTGTTT GGCCTACAGG CACATGTCAC 600 CACAC 605 890 base pairsnucleic acid double linear cDNA to mRNA NO NO G256.seq 78 GGGGCCAGTCATCTTGAAGA AGTCTTCCAC ATGCCCCTGT CACACTCATC CCTTTACCAA 60 AAGCCCCTACCCATGGGGTG GGTCAGGCAG GCCCCAAGAC AGGCCCGTAT CAGGAGGACC 120 CCTCTTCTCTCAGGGGCTGC CCTCTGGGAT AACCACCCCC GCCCTTCTGG GTTTCCTGCT 180 TCCTATCTGGCTGCAGTTTC TCAGGTCCCT TGTGGATTTC CCCATGGTCT GTCCCCACTC 240 ACATCCCCTCTCTGCAAACC TTGCCTACTG GGCCTGCACC TGGCAAATCC ATGCTCAGCA 300 CAGACGGGGATCAAGACCTC TCAATACAAC TGTCTCCTGC CAATCCCTGC CCCAGCAGCC 360 TGAGGCCCAGTCTGAAACCA GGGAGTTGCT CTCCTTTCTC CTCCCTTGAC CTCACCCCTC 420 AGACCATGCCAATTCTGCCT CCTAAACCTC CCAGGCCAGC CCCTCCCCCA GCTCCCAGTG 480 ACAGTGTCCTCAGGTACCTG AGCTCAGCTC TCGGTGCTAC CAGAGGGACT GCCAGGGGCT 540 GCAGCCGGGCCTCCTGCAGA GGCTGAGTCC CACACGCAGG GAACAGCCAT GCCACTGCTA 600 GCAGACCAGTAAGAGAATGG CCACCTGGGG CCTGAGCGCC CTCGGCCATC CACCAGAAAC 660 AAAGTGTCAAGGAGAAGCTG CCCGAAGCCC ATGGGACAAA CCACTGGGGA CTGGAACACC 720 AGTAATTCTGTATTGGGAAG CGGCACCAAG AGATGTGCTT CTCAGAGCCT GAGGCTGAAC 780 GTGGATGTTTAGCAGCGTGA CCGGCTACCA GACAAACTCT CATCTGTTCC AGTGGCCTCC 840 TGGCCACCCACCAGGACCAA GCAGGGCGGG CAGCAGAGGG CCAGGGTAGT 890 1370 base pairs nucleicacid double linear cDNA to mRNA NO NO G181.seq 79 GCAGAGGAGC CATGTCCTGCTGCTTCTGCA AAAAACTCAG AGTGGGGTGG GGAGCATGCT 60 CATTTGTATC TCGAGTTTTAAACTGGTTCC TAGGGATGTG TGAGAATAAA CTAGACTCTG 120 AACAACTGCT TTGTTACCAGTGTCTCAATT TGACTTGGGA CTTAGTGACC ATTTTAAGGG 180 AGACTGGTGC GCCACAAATCCTGGGTGGCT TGATCCTGCC ACGTGGATGC TGTCTGGGTG 240 AGCTTGTTCT CACACTGCCCTCCTGCCACC CCCATTTCCA GAAAGGTGAT GATAACCCTA 300 GCAATCTTGG AAAATCCACAGGAACTGCTA CCAGGTACCA GGAGCCGTTC TGAGCATTTT 360 ACCTATGCTA TCTAACTTATTCCTCACCCC AACCAAGAGT ATGTTTTCTC CGTTTCATGG 420 GAAACTGAAG TTCGGCCTGGTTGAGCAACT GTCTAAGCTG ATAGTGGCCC AGCTGGGGCT 480 TGAATTCAGG TCCCTGTGGTCTGGAGCATG CTAATCCTGT GGCATGTCTC CCCCTAGTGG 540 TCCTTCCAGA AACTGCAGCCGCCGCCCCTG CTCCTCCCAG GGCCAACATC AGGGATCAAC 600 ATCCCCTGAC CCCCTCAAGGCAGCAGGTTC TGCTGACACA AGCCACCCAA TTCTTCATTC 660 CATTCCTTTA AAACCCTCCAAGCCTGGAGT CTCCACCCCT GCCTAAGCCC CCAGCCTCTC 720 CTGCCTGATG ATTTAGCAGCCACCCTGTAG GCCTCCCGGC CAGCCCTGGA ACCCACACCC 780 TGACGATCTG TGCTCTACTGGGGAGCCAGA TGGAGTTTTA GAAAATGCAA ATCTGACCAT 840 GTGGATCTAT ACTGAATCCCCCAGTCCTCG GGGTCTTCTG GACCTTGTCC ATATCCTTAG 900 GACAACGTAT AAGGCTCACCTCCATCTGTT GCTTCTGTTT CCCCCATGGC TACCACCCTA 960 ATCAATGCTC CAGCCAACAGGAGTGCTGGG ACTTCCTAGA CAGCCTTCCA TGAAGCCTCT 1020 CTTCATGCCC TGGAGCTTCTATCCACATTG TCACCTCAGT CTGGCATGCC CTTACTTGGC 1080 TTGATGAGTT CCTATTCCATGGACCAACTC AAACTCTGCC CACCTCTGGA CTGTCCACAC 1140 CATCCCAGGA TGGGGCCCTCCTCCGAAATA GGTGGGTACA CAGGGACCCA CTGGAGGGAC 1200 AGCCACTGTG GCACGGAGGTGGTGCAGACC AGCCTGGAGG CAGAAGGCAG GAGGCTGGGA 1260 CATCCCGAGT GTGGCTTCAGTCTACCACCT GGCCCTTTAG CCCTGAGTGC CCCCCTCTAA 1320 CTCCCCTGCA CACCACCCTGTGGCCCTACT CAGTCTGCCA GTGGAAGGAG 1370 695 base pairs nucleic acid doublelinear cDNA to mRNA NO NO G257.seq 80 GGACATGCAG CGAGCTGTGC CTGCCCAAACAGGGCCTCAG GGAAAGTCTG AGGGACCCGT 60 GAGGGATCCA GAAGAGTCTT GGAGGAGGCTCATTCCAGAA CCACTCGTCC TGCTGAGAGC 120 AGAAAGCCCA CATCTGCCAC CTCAATTCTGACCCATCAGT TCCAGGGGGA TGCAGGTGCG 180 CGAGCCGGGC AAGGGCCTGG GACTTCCACCTGGCATCTTG ACCCAGACTC TAGCTCCAGA 240 CATAGAGGGC AGGAACGGAT GCCTGCAGGACTTCAGAAAT TAAACAGGCT TCTGGTCTCA 300 TGATTTCTCC TGCTTTTGAT TTTTAATGCACCTCCCGATG GCTCTTCCCA AGAGGGCACA 360 CATAGGCTGT GGCCCCTCTG GGTGCCTGATGATCCTCCCA GCCAGAGATG AGGCTCAGAG 420 CAGAGACTCA GGAGCAGGGG ATGCATTTCTGGCCCTAGAG GGAGTACACC AGGCGAGTAG 480 TAGACACAGG TCAGGGAGGG CACTGTGGTGGGAAGGCCTG GCACACCCAT TGGGCGTTTG 540 TGTCCACAAG GACCCTCTGC CTGAGTGATGTGCATGGTGG AGTTGCCAGA TCCTGAGGGA 600 AAAAAGGAAG CCCCAAGAAC AAAGAAGCAAACAAGGAGGT CTCATTGTCC TTGGCCATCC 660 TCAAAAGTTG ACACCCCGCC ACTACTTTCTGCCTG 695 700 base pairs nucleic acid double linear cDNA to mRNA NO NOE2.seq 81 CAAACCCACT CCACCTTACT ACCAGACAAC CTTAGCCAAA CCATTTACCCAAATAAAGTA 60 TAGGCGATAG AAATTGAAAC CTGGCGCAAT AGATATAGTA CCGCAAGGGAAAGATGAAAA 120 ATTATAACCA CGCATAATAT AGCAAGGACT AACCCCTATA CCTTCTGCATAATGAATTAA 180 CTAGAAATAA CTTTGCAAGG AGAACCAAAG CTAAGACCCC CGAAACCAGACGAGCTACCT 240 AAGGAACAGC TAAAAGAGCA CACCCGTCTA TGTAGCAAAA TAGTGGGAAGATTTATAGGT 300 AGAGGCGACA AACCTACCGA GCCTGGTGAT AGCTGGTTGG CCAAGATTAGGAATCTTAGT 360 TCAACTTTAA AATTTGCCCA CAGGAACCCT CTAAATCCCC TTGGTAAATTTAACTGTTAG 420 TCCAAAGAGG AACAGCTCTT TGGACACTAG GGAAAAACCT TGTAGAGAGAGTAAAAAATT 480 TAACACCCAT AGTAGGCCTA AAAGCAGCCA CCAATTAAGA AAGCGTTCAAGCTCNACACC 540 CGCTACCTAA AAAATCCCCA CATATNTGTG GACTCCTCAC ACCCTANTGGGCCAATCTAT 600 CACCCTATAG AAGAACTAAT GTTAGTATAG GTAACATGAA AACATTCTCCTCCGCATAAG 660 CCTGCGTCAG ATTAAAACAC TGAACTGACA ATTAACAGCC 700 254 basepairs nucleic acid double linear cDNA to mRNA NO NO E9f.seq 82CCCTGAGTTA AGGATCAGTT GGNTGTGGTG TTAGCTAAGA AGGCTGCCAC CCATCATTCA 60CATAATGAAG TGACTCAGNG GACTGTGCTG ATGGTTCTGT CCCAGGCACA GAAGACTAGG 120AGGCTATGGA GGGAGGACAG ACTGAATTAT GTTTCANGTG CAATGGGGGA GGAGAGGCAG 180GCAGCAAGTT CCTGGGCCCA AAGTGGCACG GGTGCAGAGT GGGAAGGTGG CAAACCCCTC 240TGTGCTGGTG ATAG 254 391 base pairs nucleic acid double linear cDNA tomRNA NO NO E9r.seq 83 TGCANCTGTG CCTCCTGTCG GTGTATTTGC ATNTGGTGCTGCCTATGTAG GGTTGCTATT 60 CCCTCCTCCT CACTCTGTCC GGAGAACCCC CATCCATCCTTCGAAGTCCA GCTTGGTAGN 120 TGATCGATAA CACACATGCC CAGAGAGGAG CTTCTTCTGTCCCTGGAATA CAGNCTCTCC 180 TNGTACCATA TCTGTCGCCC AAGTGCAAGT GGNTCTGATTGGATGTGTCC CAGCTTCTCT 240 TGCAGGTCTT AGGTGGGCTT GGTCCTTGAA AGCACTGGCCAATCAGAACT TGCCACTCGA 300 AAACAGTCGA GAGCTGCCTG TGGGGTTGGA GTTCGGATGCTNGATTTCTG GTTCTCACAG 360 ATGTNAANAA TCCTTAGACC TGCTGTTCCA A 391 302base pairs nucleic acid double linear cDNA to mRNA NO NO G123con.seq 84CAAGCATGAT ACCAGGGCCA TGGCAGGTGA GCTATCCCCA GGTTCAGTGG AGAGAAGCTA 60CTCCTGGCCT TTCTCCCACC AGTCAGAGAA GCAGCTGCAT TATATGGCTA AAGGGCTGAC 120CTCAGCTTTG TTCCAGCCCT TGGGGCAGCT TGACCTTAGG CCATCACTTG GCCCACACTC 180TATACTCTTG GGGAGCCTTG AACTGCCAGG CCACTGGCTG GCATGGTTCT GAACCTGAAC 240CTCTGAGATG TCTGGGTGCT CTCAGGGTAA GAGGCAAAGA GAGGCAGCCT ACCACCTCCC 300 AC302 2995 base pairs nucleic acid double linear cDNA to mRNA NO NOA116con.seq 85 CCGCAAATGT CTGTAAACTT GGCCCAACAT GAGAAATCTA GCTGGTGTGGCTGAGCACCA 60 CTTTTGTCAG TTTACCCCAG TGGAAAGTGG TCCTTGATGA CAGTTATCTCATCACAACCC 120 CAGCAAAACA GATCTTTCAA CAACCTCCTA GCTCCCTCAA CCTAGAACTTCCACCATTAG 180 TGCTTATACT GAAACTCTTG AGAATGACCC ACAGAAGCAC TGTCAGGTCCTTAGAATGAT 240 CTAATTATTA AGCACAAACT TGGTTCAGAT CTTACTAATA CTGCTCCGTTTCAGGCACTG 300 TTTGGCTAAT GACTCGGTAG TGTCAGCTGC CATCACCTTT AGGTGAGCTGTCAGACCTTA 360 TTTAGTTTCT TTTCCTAGGA AAAGGCTGAA GTCTTGGTAG GATGTGATGGGACAGCTCTC 420 CATCCTGAAA GGTCTGGCTC TATCCCACAC TACTCCAGGC TTGGTCCCTAGGACCCAGCT 480 CCATTGTGAT GGGGGCCAGG GGAGAGTTTT GAGGTCCCTC CCTGGAGATTCCAAGCAATT 540 GCTACCAATC ATTTAGGACT CCTGGGATCA GAGGGGATGG GGAAATTGAGGACTCCATCT 600 TGGTTGTGGA GATTGGAAGC TGATGGACAT AATTCCTCTC TCTTTTTGGTAATTTACAAG 660 CAATTAACTT TGCTTTTAGA ACTTGAGAGA TTTCCACAGC TGCCTAAGACTTCACATACT 720 CACTGCCTAC CCTCCATCAG GGATTAGATT GAAGGGCAGG AGAAAAAGAAGTCAGAGCTG 780 CTGCTTGTTC TGGGTGGTAT CACTTTCCTC CCGTCAGTCC ACTCTACCTTGCTCTGCACC 840 ACCTGCCATC ACCCACTAGG AGAACCCAGA TGGAAGCCAC AGGTGGCTAGCCACCTCAAT 900 CCAAATCTTA AAAAGGGCTG GTCCCTGAAG GAGCTGGAAA AGTCCATCCTGTCTATTCCT 960 CTGTTCCCAG GTGAAGCTAT TCTTGAGAAA TCCAAGTGGA GCTTCCAAGTAGAACTTCTT 1020 TTTTTTTTTT TTTTTGAGAC AGAGTCTCGC TCTGTTGCCC AGGCTAGAGTGCAGTGGCGT 1080 GATCTCAGCT CACTGCAACC TCCACCTCCT GGGTTCAAGT GATTGGCCTGCCTCAGCCTC 1140 CCTAGTAGCT GGAATTACAG GCATGCACCA CCACGCCTGG TTAGTTTTTTTTTAGTAGAG 1200 ACAGGGTTTC ACCATGTTGC CCAGCCTGGT TTTGAACTCC TGAACTCAGGCAATCCACCT 1260 GCCTCGGCCT CCCAAAGTGC CAAGATTACA AGTGTGAGCC ACCGCGCTCGGCCCAAGGGG 1320 AGCTTCTGAC AAGCAGGGCC TGGGATAGGG GCCTGTCCAG GCATCCACATATAGAATATT 1380 TACCCAGCAG GAGTCCCCCT GCCACTCACA CAGCATCTCC AAGATCAGGGACCAGTACTT 1440 CCTGAGCTTG ACAGAGAATG AATGTGTCAG ACTGACCTCT GCCCATTTTGTAGTTTTCTC 1500 ATCATTTTCT CACTCAGTCT TCCCTTTTCA AGGGCCCACA CTCTTCCCGAGGGCTGGGCC 1560 TAGTGAGCGG GGTCACAGTA CATATGGTTT CTGGGACTGA GAAGGTGGAAGATGTGTCCA 1620 TAGAGCTTTT GTTTCCTAAG CAACGTATTA CTGCCATGAT TCCATTCCCTAGATGATGCT 1680 GGTGATGCAA GCTGGCTTCT CTTGGCCAGC CTACCCTACT GCTGGGTAGTGTTTATGCCC 1740 CATGGCCAGA CACTGAAGAG GGAGACAGGA AAAGCACATA TCCACACCTTCCACCCTCAG 1800 ACATTCCTGT AACTTGAGCT TATCTAAGGG GGCATTGTCA TATGTCAGGGGTTCCCAAAC 1860 TACGGTCTTC AGAAACACTG TTTACCCTCC ATAGAGGTTG TGTGCATCAGCCCAGGCAGA 1920 ATCCTGCTTC ATGAAGGTGT TTTCCTAATG CATGTGTGCA TGGACCTGTCTCATGCTACA 1980 CTGCAGGGCT GGTATTCAGC ACCAATAGTT ATTGTTGGCT GCTAAAATAGCAAACTAGCC 2040 AAAATGGCAG GTAAATAACC CCAAGCCCCT ATCGCCAGTG TCCTCCCACTACTCCAAACC 2100 CCTCTCCCTC AGACCTGCCC CCAGTCCAGT ATCTACCTGC ACTGTTCAATATGGTAACCA 2160 CTGACCACAT GTGACTATTT ACATACAGTT TATTAAATGC AATTAAAAGTTCAATTCCTT 2220 ATTGCACTGG CCACATCTCA AGTGCTTAGC TGGCACATGT GGCTAGTGCCAGTGCCTACT 2280 GTATTGAGCG GTACAGACAG ACATTTCATC ACTCTAGAAA CTGGATGGCAAGTGCTACTC 2340 AGCACAGCAG CCGTGAGGAC CTTTCTTGGG CTGCTGACTG TTCTGTCTGTGACTGTGTCA 2400 TGTCAACTGA CTTTTTGGAG CAGCATCTGT GTGTTAGCAG GACACATCACCTATGGCACA 2460 TGCCTCAAAA CTTAACACTC CTTGGGCCCC AGGAGCCCAG AATCAACTGACAGCCCTGGT 2520 GATTGTCAAG GACAGGTGAC TATGTTTATA TAAGCATGTT CCTATGACAGGAATGTCCCC 2580 TCCTTCTGCC ATTGTCTATG TGAGCATAAA CAAAAGGATT TTTTTTTTTTGAGACAAAGT 2640 CTCGCTCTTG TCACCCAGGC TGGCGTGCAG TGGCACAGTC TCAGCTCACTGCAACCTTCA 2700 TCTCCCGGGT TCAAGTGATT CTTGTGCTTC AGCCTCCACA GTAGCCGGGATTACAGGCGC 2760 CCGCCACCAG GCCCGGCTAA TTTTTTTTTT TGAGACGGAG TCTCGCACTGTCGCCCAGGC 2820 TGGAGTGCAG CGGTGCAATC TCGGCTCGCT GCAGCTCTGC CTCCGGNGTTCATGCCATCT 2880 CCTGCCTCAG CCTCCCGAGT AGCTTGGGAC TACAGGCACC CGCCACTAGGCCCGGATTAT 2940 TTTTTTATTT AGGAGGAACG GGTTCACGGT AGCCAGGATG CTTGATCTCGACCGG 2995 1870 base pairs nucleic acid double linear cDNA to mRNA NO NOA25con.seq 86 GTCATCCTTC AACAAACACT TAAAAAATGT TTGAAAACCC CATCAATTCAGTCAGACTCT 60 TTGGGTGGGA GCAAGATCCA GGCATCAGTA TTTTTTAATA TCCCAGATGATGGTAATATG 120 CAGCCAGGAT TTAAAGTCAC TGGTTTAATA TCTTGGGAAA AGCAGATCCACTCAAGACCT 180 CACAGGGTCC TGACAAAGGC CACTTTCAGC TCAGTGGAGT GGGACACTGGGGTGGGAAGA 240 TGTCCATTTT TTGGATGTGG GTCAGTCTCT TGCACAGGCA GAGGTATTGCAGCATGCTGT 300 TGTAATGTGT ATCTTCCTTG GCAGTGTCTG TTGAAAGCTG GTTGCATCAGTTTGTAATGG 360 GGTGTTATGG CAACAAGGTG GGCCCAGCCC CCCCCAGGAA GTGGATCACTGAGCACAGCT 420 TCTACAGGGC CATTTGTAGA GAGGTGGCAG ATGGGCTTCC CAGGGGCTGCCACCCAGGGC 480 AGAGCCAGTG CTGAGGCTCT GACAACCTCG GCAGGGTGGG GGAGAAGGCCAGACTCAGGG 540 TGTTTATGTT TGTGGGTAAT GACAGTCAGC TCTGGGCTCC AGATGATGCCTACTCCCTGG 600 CCTCTGTGTT CAGATTAGGA ACTTGCAACA TCTTGCTGAG GACCATGTCAGGCTCAGCTC 660 TAAGTGCTGT GGCTGAGAAT TTTCCTTCCT CTCTGTGTGG TTAGTGGCAGCCTCCCTAGC 720 AATGGCTGAC CTCTAGCATA CTCTGTCAAA CTACAGGCAG CTGGGACAAGACAGGACATG 780 GGGCTCACAG ACAGGTATTC CACAACCTGG GCCCTGTCAA CCCTCCCAGAAATGCATGGG 840 CCATGAACCT CCTGCTGTGG GAGGGGCAGT GCAGAGAAGT CTCAATAAGCTTCTCTTGGC 900 CCTCTGGGAT CTCCACCATC CACAGTGTGT AGGGCTGAGC TGCAGGCTGGGTCTTCAGGT 960 GGTGTCCCTG CACATCTGCT TTGCAGCGTG GCGTCTATAG AGCAAGAGTGAACGGGAAGG 1020 GGCCTCGGGC CTCCTGTAGC TCTGCTGGGC AGGGACGCTG CGGGGCCTCAGCTGGGCTTC 1080 CTTGGCTAAA GGGCACAGAG TGGCGTAGGC TGCAAGAGGA CAAGCTAAGCTGATGAAGGC 1140 TCTATCACTC AAGGGTAGCC ATGTAAAAAA AAATCCCTAC AGGTAAAAGAAGCATGAATG 1200 AGACAGGCGG GGCATAACAA TGTCTCCCCA CTGAAGCTGC AACTCTCTGCTTCACTGGCT 1260 TCAGCCTCCT CTCTGTGAAA TGGGGGCAAT GTCCCCTAGG CCTCTTCCTCCCTGTCCAGT 1320 TAGAGCTGAG GGTCTACAGG CCAGAGGGAG GCCTGGCTCT CAGGGCCTTGTTCTCTGTNT 1380 TNGCCTCTNC GCTGGCNACC CCAGCCCCAN TTTCCACGTC AACCTCCCTTGTTTTTTTAT 1440 TATACCNCAA CAGCAGCTCT TGGCAGCCCA GTTGGACTAC CCCCTTCCTGTTGNCTTCCT 1500 TAGCAAAGCA TTTTATGGAA TGCTTCCTTT TCATGCTTCA GGAAACCGGTGGCCGGGAGG 1560 AGTTCTTGAT TTCATTTTCT TCCCTAGAGA TATGTGTGCT TCGGAATACACAAATTAAAC 1620 AAAAGCGAGG GCTGACTGGG ACCAGGAGAG TGAGTGATCC TGGCTTCCCTTGATTTACAT 1680 GCTTATTTTC CTTCTCAAAT CACTCCAGTA AGTACAGAAG TCACTAATCTATTGCCTTCT 1740 ATTATCTGCA TTATAGTTAA AAACATCGAC ATGAACAAAC AAAAGCCCTTGCGTAGCCTA 1800 GAGAAGTCAC AAAGCTCACA CCCAGACTCT CGCCTAAGAG AGTCTCTCAGGGCTCACTCA 1860 GGGACTATTT 1870 806 base pairs nucleic acid doublelinear cDNA to mRNA NO NO A46.seq 87 CTACTACTAC TAACTCGAGA ATTCTGGATCCTCCAAACAC ACTCCACCTT ACTACCAGAC 60 AACCTTAGCC AAACCATTTA CCCAAATAAAGTATAGGCGA TAGAAATTGA AACCTGGCGC 120 AATAGATATA GTACCGCAAG GGAAAGATGAAAAATTATAA CCAAGCATAA TATAGCGAGG 180 ACTAACCCCT ATACCTTCTG CATAATGAATTAACTAGAAA TAACTTTGCA AGGAGAGCCA 240 AAGCTAAGAC CCCCGAAACC AGACGAGCTACCTAAGAACA GCTAAAAGAG CACACCCGTC 300 TATGTAGCAA AATAGTGGGA AGATTTATTGGTAGAGGCGA CAAACCTACC GAGCCTGGTG 360 ATAGCTGGTT GTCCAAGATA GAATCTTAGGTCACTTTAAT TTGCCACAGA ACCCTCTAAA 420 TCCCCTTGTA AATTTTCTGT TAGCCCAAAGAGGAACAGCT CTTTGGACAC TAGGNNNNNA 480 CCTTGTAGAG AGAGTGAGAG AATTTAACACCCATAGTAGC CCTAAAAGCA GCCACCAATT 540 AAGAAAGCGT TCAAGCTCAA CACCCACTACCTAAAAAATC CCAAACATAT AGCTGAACTC 600 CTCACACCCA ATTGGGCCAA TCTATCACCCTATAGAAGAG CTAATGTTAG TATAAGTAAC 660 ATGAAAACAT TCTCCTCCGC ATAAGCCTGCGTCAGATTAA AACACTGAAC TGACNATTAA 720 CAGCCCAATA TCTACAATCA ACCAACAAGTCATTATTACC CTCACTCTCA ACGAGGATCC 780 AGAATTCTCG AGTTAGTAGT AGTAGT 806639 base pairs nucleic acid double linear cDNA to mRNA NO NO A66.seq 88CTGAAGACTG TTCCCAGTGG TCTTAGATAC AGAGTGGTGG CCCTTGGTCC TGTCAGAGTA 60GGTTTAAAGA CCACACAGGT AGATTTCTCC CAGAAACAAC ACCACTTAGA ATTTTCCTTC 120AGGGAGCATA GCACAGGGGA GATGTCCACA CACAGTCATA CCTGTGTTTG GAATCCCAGC 180TCTGCCTTTT TGCTTGTGGG TGGTCGAGTT GGGAGTGTGC TTGAGAAATT ATTCAGCCTC 240TTCAACTCTC AGTTTCTACC TCTCCTTTCC AGGCTGAGCT GAACATCACA GAGGGGAATA 300TCTGTGATTT TCTTGAGAAA CTTCACAGCG AAAGCTGCTG GCTCTGCCCT TGGTAGCCAT 360TTTTATGGTC TGGAGGGACA GTGGCTTCTT CCTAGAGCCA CTTTGCAGTG TTCCCTTGAG 420GCCAGCTGTC CATCCTCGAG AGCAGTTAGG AGGTCCATGT TGAGAGTGTG CTCAGTCCTT 480AGTTGGAAAC CTGGAAACGC AGGCCATGAG GGTGGTGTCC CACTGGCATA TGGCAGGTGG 540GGCCTTCTGC CACCCTGGCT GTGTGTGTGG CGTCCAGTGC GAGTGGTAGC CAGACATCAT 600GCCCACCTGC CCTCGAGCTG CTTGCCTGCA GCTGGCTCC 639 509 base pairs nucleicacid double linear cDNA to mRNA NO NO A42.seq 89 CTACTACTAC TAACTCGAGAATTCNGGATC CTCCCCGAAA CCAGACGAGC TACCTAAGAA 60 CAGCTAAAAG AGCACACCCGTCTATGTAGC AAAGTAGTGG GAAGATTTAT AGGTAGAGGC 120 GACAAACCTA CCGAGCCTGGTGATAGCTGG CTGCCCAAGA TAGAATCTTA GNNCAACTTT 180 AAATTTGCCC ACAGAACCCTCTAAATCCCC TTGTAAATTT ANCTGTTAGT CCAAAGAGGA 240 ACAGCTCTTT GGACACTAGGAAAAAACCTT GTAGAGAGAG TAAAANATTT AACACCCATA 300 GTAGGCCTAA AAGCAGCCACCAATTAAGAA AGCGTTCAAG CTCAACACCC ACTACCTAAA 360 AAATCCCAAA CATATAACTGAACTCCTCAC ACCCAATTGG ACCAATCTAT CACCCTATAG 420 GAGAACTAAT GTTAGTATAAGTAACATGAA AACATTCTCC TCCGCATAAC CCTGCGAGGA 480 TCCAGAATTC TCGAGTTAGTAGTAGTAGT 509 1834 base pairs nucleic acid double linear cDNA to mRNA NONO A76con.seq 90 GGCAGCCAGC GCAGGGGCTT CTGCTGAGGG GGCAGGCGGA GCTTGAGGAAACCNCAGATA 60 AGTTTTTTTC TCTTTGAAAG ATAGAGATTA ATACAACTAC TTAAAAAATATAGTCAATAG 120 GTTACTAAGA TATTGCTTAG CGTTAAGTTT TTAACGTAAT TTTAATAGCTTAAGATTTTA 180 AGAGAAAATA TGAAGACTTA GAAGAGTAGC ATGAGGAAGG AAAAGATAAAAGGGTTTCTA 240 AAACATGACG GAGGTTTAGA TGAAGCTTCT TCATGGAGTA AAAATGTATTTAAAAGAAAA 300 TTGAGAGAAA GGACTACAGA GCCCCGAATT AATACCCAAT AGAAGGGCAATGCTTTTAGA 360 TTAAAATGAG GGGTGACTTA AACAGCTTAA AGTTTAGTTT AAAAGTTGTAGGGTGATTCA 420 CATAATTTGN AGGCGATCCT TTTCAAAAGA GATTAAACCG AAGGTGATTAAAAGACCTTG 480 TAATCCATGA CCCAGGGAGA ATTCCGTCAT TTAAACCCTA GTTAACGCATTNTCTAAACC 540 CAGGCGAANC TGGAAAGATT AATTGGGAGC TGGTAGGATG AAACAATTTGGAGAAGATAG 600 AAGTTTGAAG TGGCAAACTG GAAGACAGAA GTACGGGAAG GCGAAGAAAAGAATAGAGAA 660 GATAGGGAAA TTAGAAGATA AAAACATACT TTTAGAAGAA AAAAGATAAATTTAAACCTG 720 AAAAGTAGGA AGCAGAAGAA AAAAGACAAG CTAGGAAACA AAAAGCTAAGGGCAAAATGT 780 ACAAACTTAG AAGAAAATTG GAAGATAGAA ACAAGATAGA AAATGAAAATATTGTCAAGA 840 GTTTCAGATA GAAAATGAAA AACAAGCTAA GACAAGTATT GGAGAAGTATAGAAGATAGA 900 AAAATATAAA GCCAAAAATT GGATAAAATA GCACTGAAAA AATGAGGAAATTATTGGTAA 960 CCAATTTATT TTAAAAGCCC ATCAATTTAA TTTCTGGTGG TGCAGAAGTTAGAAGGTAAA 1020 GCTTGAGAAG ATGAGGGTGT TTACGTAGAC CAGAACCAAT TTAGAAGAATACTTGAAGCT 1080 AGAAGGGGAA GTTGGGTTAA AAATCACATC AAAAAGCTAC TAAAAGGACTGGTGTAATTT 1140 AAAAAAACTA AGCAGAAGGC TTTTGGAAGA GTTAGAAGAA TTTGGAGGCCTTAAATATAG 1200 TAGCTTAGTT TGAAAATGTG AAGGACTTTC GTACCGGAAG TAATTCAAGATCAAGAGTAA 1260 TTACCAACTT AATGTTTTTG CATTGGACTT TGAGTTAAGA TTATTTTTTAAATCCTGAGG 1320 ACTAGCATTA ATTGACAGCT GACCCAGGTG CTACACAGAA GTGGATTCAGTGAATCTAGG 1380 AAGACAGCAG CAGACAGGAT TCCAGGAACC AGTGTTTGAT GAAGCTAGGACTGAGGAGCA 1440 AGCGAGCAAG CAGCAGTTCG TGGTGAAGGT AGGAAAAGAG TCCAGGAGCCAGTACGATTT 1500 GGTGAAGGAA GCTAGGAAGA AGGAAGGAGC GCTAACGATT TGGTGGTGAAGCTAGGAAAA 1560 AGGATTCCAG GAAGGAGCGA GTGCAATTTG GTGATGAAGG TAGCAGGCGGCTTGGCTTGG 1620 CAACCACACG GAGGAGGCGA GCAGCCGTTG TGCGTAGAGG ATCCAAGGCCACCATCCCAT 1680 TGTCCCAAGG CCACAGGGAA AGCGAGTGGN TGGTAAANAT CCGTGAGGTCGGCAATATGT 1740 TGTTTTTCTG GAACTTACTT ATGGTAACCT TTTATTTATT TTCTAATATAATGGGGGAGT 1800 TTCGTACTGA GGTGTAAAGG GATTTATATG GGGG 1834 1666 basepairs nucleic acid double linear cDNA to mRNA NO NO E105con.seq 91CTACTACTAC TAACTCGAGA ATTCTGGATC CTCTGTAGCT TTTTTTTTTT ACAGACTNCA 60CAGAGGAATG CAGTTGTCTT GACTTCAGGT CTGTCTGTTC TGTTGGCAAN TAAATGCAGA 120ACTGTTCTNA TCCCGCTGCT ATTAGAATGC ATTGTGAAAC GACTGGAGTA TGATTAAAAG 180TTGTGTTCCC CAATGCTTGG AGTAGTGATT GTTGAAGGAA AAAATCCAGC TGAGTGATAA 240AGGCTGAGTG TCGAGGAAAT TTCTGCAGTT TTAAGCAGTC GTATTTGTGA TTGAAGCTGA 300GTACATTTTG CTGGTGTATT TTAAGGTAAA ACGCTTTTTG TTCATTTCTG GTGGTGAGAG 360GGGACTTGAA GCCTTAAGTC TTTTCCAGAT GCAACCTTAA AATCAGTGAC AAGAACAATT 420CCAACCAAGC AACAGTCTTC AAGAAATTAA ACTGGCAAGT GGAATGTTTA ACAGTTCAGT 480GTCCTTTAGT GCATTGTTTA TGTGTGGGTT TCCTCTCTCC CCTCCCTTGG TCTTAATTCC 540TTACATGCAG GAACACTCAG CAGACACACG TATGCGAAGG GCCAGAGAAG CCAGACCCAG 600TAAGAAAAAA TAGCCTATTT ACTTTAAATA AACCAAACAT TCCATTTTAA ATGTGGGGAT 660TGGGAACCAC TAGTTCTTTC AGATGGTATT CTTCAGACTA TAGAAGGAGC TTCCAGTTGA 720ATTCACCAGT GGCCAAAATG AGGAAAACAG GTGAACAAGC TTTTTCTGTA TTTACATACA 780AAGTCAGATC AGTTATGGGG AGGATCCAGA ATTCTCGAGT TAGTAGTAGT AGTCTACTAC 840TACTAACTCG AGAATTCTGG ATCCTCTGTA GCTTTTTTTT TTTACAGACT NCACAGAGGA 900ATGCAGTTGT CTTGACTTCA GGTCTGTCTG TTCTGTTGGC AANTAAATGC AGAACTGTTC 960TNATCCCGCT GCTATTAGAA TGCATTGTGA AACGACTGGA GTATGATTAA AAGTTGTGTT 1020CCCCAATGCT TGGAGTAGTG ATTGTTGAAG GAAAAAATCC AGCTGAGTGA TAAAGGCTGA 1080GTGTCGAGGA AATTTCTGCA GTTTTAAGCA GTCGTATTTG TGATTGAAGC TGAGTACATT 1140TTGCTGGTGT ATTTTAAGGT AAAACGCTTT TTGTTCATTT CTGGTGGTGA GAGGGGACTT 1200GAAGCCTTAA GTCTTTTCCA GATGCAACCT TAAAATCAGT GACAAGAACA ATTCCAACCA 1260AGCAACAGTC TTCAAGAAAT TAAACTGGCA AGTGGAATGT TTAACAGTTC AGTGTCCTTT 1320AGTGCATTGT TTATGTGTGG GTTTCCTCTC TCCCCTCCCT TGGTCTTAAT TCCTTACATG 1380CAGGAACACT CAGCAGACAC ACGTATGCGA AGGGCCAGAG AAGCCAGACC CAGTAAGAAA 1440AAATAGCCTA TTTACTTTAA ATAAACCAAA CATTCCATTT TAAATGTGGG GATTGGGAAC 1500CACTAGTTCT TTCAGATGGT ATTCTTCAGA CTATAGAAGG AGCTTCCAGT TGAATTCACC 1560AGTGGCCAAA ATGAGGAAAA CAGGTGAACA AGCTTTTTCT GTATTTACAT ACAAAGTCAG 1620ATCAGTTATG GGGAGGATCC AGAATTCTCG AGTTAGTAGT AGTAGT 1666 1033 base pairsnucleic acid double linear cDNA to mRNA NO NO G180con.seq 92 GGATCCTCTAGAGCGGCCGC CTACTACTAC TACTCGAGAA TTCTGGATCC TCGGCCTGTG 60 GAGGATAGGGAAATGGAACA GTCCTGGCAG TTCAGGTGGC TGGGGGCTGG GACCTTTGGA 120 CAGGCTGTATATTAAGCAGG TTTCACGCGA CGCCTCCTCT TCTGAGCATT CCCCATGGCC 180 TCGCTAGTGCAGGATGGACA GGAACTGTAC TCAGGTCCAG GTGACAGCTC ATGCCTGGAT 240 TTTCCCAAGGGGGCGGATCC AACCAGTCCA CCAGTCCGTT CTGGCAGCTT TAAGCTCCTC 300 TCAGACAGGATGACATCCAA CTTGTTTCAA GGGCTTTCCA GTGGGCTGCA CTTGTTCTGT 360 CCCACAGGATCTATAGGGAA ACCCCAGCTC TGGGCAAACA GACCCCCACC CCCTGACTAA 420 GGCCTCCAGAATCTGGGCCA GAGGCCAGGG TGGGGCAAGA CACTGAGCCA GGACAGCGGT 480 TTTCTGGCTTCCTTAGTTTG TGTCCACAGA CATCCTCACT ATCCTAGGAG ATGACCCCAG 540 CAGGAATGGGGAGCTGAAGC CTGAAGAGTC ACTGAAATGA TTTACATAAT TTCCTTAATG 600 CTCACCACGATCATGTGAGA GAGAGAACAT GACCCGTTTC ACAGACGCAT CACTGAAGCT 660 GTGAGCAGAGAAACGACTTG TCTTGCCAGA CAGTGGCAGA TACAACACTT AACCCAAAGT 720 CTCTGAGTCTCCTCAGGTGC CCTTTCATCA CTCTTCCTGC TACTTGGCAT CAACCAGCCG 780 GGATGAGACACTGAAAGGGA TGCCAGGTCT TTGTTATGTT GCATCCAAAT GCTAGCTAGC 840 CCCTACCAGCCCATCTACCA CACCCGGGCT GCCTCTCATA TCTAGTATCT CAGCCCTCCA 900 GACCCTCACTCCTCCTTGAG ACTCTAGCCC CCAGTCCCCA GTTCCTCCCA GACTTCCCAG 960 ACTCTTCACAATCACCAGTG TGGGAGGATC CAGAATTCTC GAGTAGTAGT AGTAGTAGTC 1020 GACCCGGGAATTC 1033 1950 base pairs nucleic acid double linear cDNA to mRNA NO NOG310con.seq 93 CTCTGTCATC CAGGCTGGAT TGCAGTGGCA CCATCACAGC TCACTGCAGCGTCAACCTCC 60 TGGGCTCGGG TGATCCTCCC ATCTCAGTCT CCTGGGTAGC TGGCACTATAGGCATGTGCC 120 ACCACGCCAG GCTAATTTTT GTATTTTTTG TAGAGATGGG ATTTCTCCATGTTTCCTAGG 180 CTGGTCTCAA ACTTCTGGGC TCAAGCAATC TGCCTATGTT GGCCTCCCAAAGTGCTGGGA 240 TTACAGGTGT GTGCCACTGC ACCCGGCAAC TTACATTTTT AAAAGATCTCTAGCTTTTGT 300 GTGGGCACAG ATTAGGTTGT AATGTTCGAC CAGAGAAACA AGTTAGGATGCTATTGCTCC 360 ATGGTGAGTG ACATGGTTAT ACAGGGTGAA TGGTGCAGGG TGGGCTGGAGGAGAAGACAG 420 AATCCTACAG TGCAGGGCAT TGTAGTGGGC ATCTGATCTC TCTCTTCTCCCACCTCTATG 480 CAGCTGCTTC TCTCTCCTCA GAATCCAGAC CCAAATTTTA CCTTCTGCTGGGAAAGCCTT 540 CCTTCCCTAT TTTTTGTTTG CAGGTGGCGG GGGCTCCCTG GACCTGGGATTCCCACGTTC 600 TTCCTCCTAA CTTGCTGCCT CGTGGCCCTA GACCCCTCTT GTGTAACACAGACATCAGTC 660 AGGCTCTCTC AGGCTCCTAA GACCTGGACG ACAGGCTCAA GCTCCTATTTGCTCACGTGC 720 AAGTGGAAAG CTTTTGCCAG GGTGTTTGCA AGTTCCCTTG TGCATGACTGTGCATGACTA 780 GCACTGACTC TCTCCTGATA CAGCATGGTT AGATCTGTGT GTGGCTCATCAGGACATTCA 840 ACAAGTAATG CCCCTGTTCT GCACCCCACA GAAGGCAGTC CTTTCCACTGAGTCCCATTC 900 ACACAGCCAA GCTGACCATC ACCCGGATCT GCCTGTGGCA GAAGCAACTTCAAAGTGAGC 960 GCTAGTGCTC CTATTCTTGA AGTCCTGTGG TCACGCTACA GTGATAGAACTTCTTCTTCT 1020 TCACCCCCTT TCCATTCTGT CTGCAGCTTT GTGCCATCTT GCCAGTTCCCCCTCTCTCTT 1080 CACCCAATTG CAGTTTATTT CTAATACACA GAGCAATTTC TGTAGCCCTTTTGTAACAAT 1140 TCATTGCTCA CCTATGGACC CAAGATCTCA GCTTCCTACC TCCCTCTAGTGGCTGATGCA 1200 GGTATTTCCA AAAAAAAAGT CCTAGAGCAG GATCCTGGCT GGCCACACGGCTGTCCAGTG 1260 CTGCTCCTGC CCACAAGGTT CTAAGAGGTT AAGGCTTGAC ATATCAGAAAAGGAAAGGAA 1320 GCCTGTGTGA CACAGAAGCC TGGGTTGAGG GAGGCTACGC TCTGTGTACTGTCCCCGGGC 1380 AGAGGCGGTT TTCTGGGTCA CCTGCATGTC CCAACACCGG CCTCTGGTGGTCGGCAGATG 1440 TTAATCCTAA AACCCTTCTG TCCCCACCTC AGAGGTGAAG TACCTGTGCACTAGCCTTCC 1500 CCGTCTGGGT CCCCCAAGGC CCCCACACTG GGCGCACAGG GTACAGGGAGGAGCCAAGCC 1560 CTCTGCTCCA GTTCTGCCTT CTGCGCAGGA GCCCTTTGAC TTCTGGGAGTCAACCCCAGC 1620 TCACCCAACA AGGAGATAGG GCAGGTGGGA GACACCCTAA GCTCAGAAGGCCTACAGGAG 1680 ATGGAGAGCA CCCATCCTCC ACCTCTACTC CTTCTCCAGA CCACTCCACACCTCGCAGCT 1740 TCTTGCTCCT CACCCTCGCA TTTGGCCCAG TGGGCACCAA GAACAAGCCAGGGTGACTGG 1800 CTAAGCTGGG GCCAAACTCA CTGACAGAAT TGGAATTGTG TCAAAACACCACTTTTATGT 1860 CCTCACCTTT CAGGCCTGCA TCAGTGTGAG CTCTGCAGAG AAAGGGGCCTGTCTTACTGA 1920 ACCCTCAGAT CCCAGCACGC TGCTGTCCTA 1950 1328 base pairsnucleic acid double linear cDNA to mRNA NO NO G326con.seq 94 GTTTCCAAGATGATTGTAGA ACTAAAATGA GTTGTAAGCT CCCCTGGAAG AAGGGATGTG 60 GAACCTGTAACTAGGTTCCT GCCCAGCCTG TGAGAAGAAT TTGGCAGATC ATCTCATTGC 120 CAGTATAGAGAGGAAGCCAG AAACCCTCTC TGCCAAGGCC TGCAGGGGTT CTTACCACCT 180 GACCCTGCACCATAACAAAA GGACAGAGAG ACATGGTAGG GCAGTCCCAT TAGAAAGACT 240 GAGTTCCGTATTCCCGGGGC AGGGCAGCAC CAGGCCGCAC AACATCCATT CTGCCTGCTT 300 ATGGCTATCAGTAGCATCAC TAGAGATTCT TCTGTTTGAG AAAACTTCTC TCAAGGATCC 360 AGAAAATATGCTCTTTAAAA TATTTTAAAA CTGATATAGA CCCAAAGGAG AGACCCAGTA 420 ACAATATTCAGCTATATTAT CCATTCTCTC TTTCTTTCAT TCAACAAATC TGTATTGATC 480 ACAGGCTCTCTGCTGGGTGT GGGATGCAGC TGTGGGCCTG TGCTGGAGGT CCTTAGAGGC 540 CAGTACTCCTATCCTGGGCT TTATCTGCAT GGATTGCTGC AGTGTTGGGC TCCACTGCTG 600 TGTGAAGCAATTGCTCCTGC TCTTTCTGGG CATGGGAGAA GGGTCAGAGC AGTCGGACAC 660 AGATTCCCAGGCAGGAGAAT GGAACTCCTT CCGAGGAAGA AGACGTGTTT TCCTTCCAGC 720 ACACACCCAGGCATGGTGGT CAGGACCGTG GACCAGGTCC CCATCTTGTG CATGCACCAA 780 GCCCCAGGATCAGGAGCAGA GCTAGTGAGG GAGCAAGATG GATGAGGACA GCACGGTGCT 840 GACCACTCTAGACAGACAGG AGACAGGAAA CAGGATCTCA CTTGCAAAAA GACTGATCTC 900 AACTTGATCAATTAGGCAGA TACTTGAGTT CCAGTATACT CCAGGACTAT TCTAGGGGCT 960 AGGATTCAACAGTGAATAAA ACAGACAAAA TCCTTTCCCT TGTACACTTA TATCTTCTCA 1020 AAAAAGCTCCTTTCCCCTCT TTCTTATCAG GGTCTAATAT AGTTAATAAG GACTTAAGAC 1080 TGGAATATCACATCTAAATC CCCAATAATG AGCCCTCACC AATCTGCCAG GTCCCAGAGA 1140 AGCTAAAAACAATCAGGGCT GTTTGCAGCT AACTGAAATA AAACTTGATT CGAACTCATG 1200 TCAAGCCTGTTGACAACACA CACACATGTC CACGTGTCAC TGCTGTGCAT AGAAACCTCT 1260 GACTCACTACCATCTGAAGT CCAGGCTCCT TCACAGGTCA TTCAAGGTCG ACCTCTGCCC 1320 CCTCTGAC1328 1093 base pairs nucleic acid double linear cDNA to mRNA NO NOG164con.seq 95 GTGCTCAATA AATATTTTTG GAAAGAATAA ATCTTCAATC AATCCTATTCAGTAGGTTTG 60 ATGATCACTT CCAATCTATA GAAAGGAAAA CTGACTCCTA GAAAGATTAATTAACTTGCC 120 CAATGGCAGG TAGCAGTAGA AGCAGAACTT AAAACCAGGT AGCCTGACTTTAGCATCTTA 180 ACACTGGGTT GTTTTGCTTC TACTACTTGC ACTGAAGGCA CTTACCACAATTTATAGTTG 240 TTTTATTTGT TTGTTGTCTG ACCCTCCTAA ATGTGTATGA TGCTGCTAGAGCAGGGCTAT 300 GTCCTGCTCC CTGCTGTGCC ACCAATACTT AGAACAGTGC CTGGCACATTGCAGCTGTGT 360 GAGTATTTGC TGAGTGAATG AATAAACAAC CCAAATGAAC AGACAAGTGAGGGATGACTG 420 TGGAGGAATA GGGGGTGCCA GTGTGGCAGT TTCCCAGGCC CCAGCTGGATCCCAGTGCCC 480 AGTCCAGCTG TACCCACGTA AAGGGATCTG CCAAGAGGTG GCTTTTCGCTGTTGCAGAAG 540 GCATCTCTTG GGGCTGATGA CGGTGAGTCT CTCATTCTTA ACAGCAAGAGTCACCCTGCT 600 CCATGAATCT TCAAATTTGG GGTCATTTCC CACCTAAAGG CAGAGATTTGGCCTATGTTC 660 CCAACCACAG CTGAGAGTCC AACCTGCCCC TCGGGTGACA CACATGGCTCTGGGTAGGAT 720 CCGTGTATAC TGCCTCGATT CTACTCATTA CATTATGTCA GCACCTTTTTCAGCTTCTGA 780 GAAACAGGAA GCATCATGAT GTGTGGTGGG GCTTGAAGAA GATGATAAGAGACATAATCA 840 CATTTCTTTG GTTGGGGCAC AGAGGGCTGG GGTTCCTGTT TGCTCTGACTCTAAAGTGTC 900 ACCTTTTCCC TTAAGCCAGA ATGTTGGAGG ATGAGGACTA TTTAGACAACCTGCTTTCAA 960 GGGGAAAGAA AAGAGCAGGG ATCAGAGCCT TTAAAATTAT TATTATGAAACATCATACAT 1020 ACAAAAAAAT TACAATCTCT ATGTATAGTT GTTAAAACAT AACAAAACCCATGTGTCCAC 1080 ACCTGGATCC TGG 1093 2104 base pairs nucleic acid doublelinear cDNA to mRNA NO NO G65.seq 96 GAATTCTGGA TCCTCGACCG GGCGACGCCGCGGGAGGTTC TGGAAACGCC CGGAGCTGCG 60 AGTGTCCAGA CACTTCCCTC TGTGACCATGAAACTCTGGG TGTCTGCATT GCTGATGGCC 120 TGGTTTGGTG TCCTGAGCTG TGTGCAGGCCGAATTCTTCA CCTCTATTGG GCACATGACT 180 GACCTGATTT ATGCAGAGAA AGAGCTGGTGCAGTCTCTGA AAGAGTACAT CCTTGTGGAG 240 GAAGCCAAGC TTTCCAAGAT TAAGAGCTGGGCCAACAAAA TGGAAGCCTT GACTAGCAAG 300 TCAGCTGCTG ATGCTGAGGG CTACCTGGCTCACCCTGTGA ATGCCTACAA ACTGGTGAAG 360 CGGCTAAACA CAGACTGGCC TGCGCTGGAGGACCTTGTCC TGCAGGACTC AGCTGCAGGT 420 TTTATCGCCA ACCTCTCTGT GCAGCGGCAGTTCTTCCCCA CTGATGAGGA CGAGATAGGA 480 GCTGCCAAAG CCCTGATGAG ACTTCAGGACACATACAGGC TGGACCCAGG CACAATTTCC 540 AGAGGGGAAC TTCCAGGAAC CAAGTACCAGGCAATGCTGA GTGTGGATGA CTGCTTTGGG 600 ATGGGCCGCT CGGCCTACAA TGAAGGGGACTATTATCATA CGGTGTTGTG GATGGAGCAG 660 GTGCTAAAGC AGCTTGATGC CGGGGAGGAGGCCACCACAA CCAAGTCACA GGTGCTGGAC 720 TACCTCAGCT ATGCTGTCTT CCAGTTGGGTGATCTGCACC GTGCCCTGGA GCTCACCCGC 780 CGCCTGCTCT CCCTTGACCC AAGCCACGAACGAGCTGGAG GGAATCTGCG GTACTTTGAG 840 CAGTTATTGG AGGAAGAGAG AGAAAAAACGTTAACAAATC AGACAGAAGC TGAGCTAGCA 900 ACCCCAGAAG GCATCTATGA GAGGCCTGTGGACTACCTGC CTGAGAGGGA TGTTTACGAG 960 AGCCTCTGTC GTGGGGAGGG TGTCAAACTGACACCCCGTA GACAGAAGAG GCTTTTCTGT 1020 AGGTACCACC ATGGCAACAG GGCCCCACAGCTGCTCATTG CCCCCTTCAA AGAGGAGGAC 1080 GAGTGGGACA GCCCGCACAT CGTCAGGTACTACGATGTCA TGTCTGATGA GGAAATCGAG 1140 AGGATCAAGG AGATCGCAAA ACCTAAACTTGCACGAGCCA CCGTTCGTGA TCCCAAGACA 1200 GGAGTCCTCA CTGTCGCCAG CTACCGGGTTTCCAAAAGCT CCTGGCTAGA GGAAGATGAT 1260 GACCCTGTTG TGGCCCGAGT AAATCGTCGGATGCAGCATA TCACAGGGTT AACAGTAAAG 1320 ACTGCAGAAT TGTTACAGGT TGCAAATTATGGAGTGGGAG GACAGTATGA ACCGCACTTC 1380 GACTTCTCTA GGCGACCTTT TGACAGCGGCCTTCCAACAT TAGGGCAGAG GGGAATAGTG 1440 TTAGCGACGT TTCTTAACTA CATGAGTGATGTAGAAGCTG GTGGTGCCAC CGTCTTCCCT 1500 GATCTGGGGG CTGCAATTTG GCCTAAGAAGGGTACAAAGC TGTGTTCTGG TACAACCTCT 1560 TGCGGAGCGG GGAAGGTGAC TACCGAACAAGACATGCTGC CTGCCCTGTG CTTGTGGGCT 1620 GCAAGTGGGT CTCCAATAAG TGGTTCCATGAACGAGGACA GGAGTTCTTG AGACCTTGTG 1680 GATCAACAGA AGTTGACTGA CATCCTTTTCTGTCCTTCCC CTTCCTGGTC CTTCAGCCCA 1740 TGTCAACGTG ACAGACACCT TTGTATGTTCCTTTGTATGT TCCTATCAGG CTGATTTTTG 1800 GAGAAATGAA TGTTTGTCTG GAGCAGAGGGAGACCATACT AGGGCGACTC CTGTGTGACT 1860 GAAGTCCCAG CCCTTCCATT CAGCCTGTGCCATCCCTGGC CCCAAGGCTA GGATCAAAGT 1920 GGCTGCAGCA GAGTTAGCTG TCTAGCGCCTAGCAAGGTGC CTTTGTACCT CAGGTGTTTT 1980 AGGTGTGAGA TGTTTCAGTG AACCAAAGTTCTGATACCTT GTTTACATGT TTGTTTTTAT 2040 GGCATTTCTA TCTATTGTGG CTTTACCAAAAAATAAAATG TCCCTACCTG AAAAAAAAAA 2100 AAAA 2104 1534 base pairs nucleicacid double linear cDNA to mRNA NO NO Septin-2.seq 97 CGGGGAGGCCGGTCCCGCGG GCGGGGGAAG GGGCGGTTCC GCGGCTTCTC CCGCCGCCGC 60 CGCCAAGGGGAGTTTCCAGG AAGTGGCCAT ATTGGATCCA TTCAGCCGCA GCCCGCCCGG 120 GCGGAGCGCGTCCCGCAGCC GGCTGGTCCC TGTCTCTGCC CCTGCGCTCG TCCCAGCCCA 180 CCCGCCCGGTGCGGAGCTCG CCATGGCGGC CACCGACCTG GAGCGCTTCT CGAATGCAGA 240 GCCAGAGCCCCGGAGCCTCT CCCTGGGCGG CCATGTGGGT TTCGACAGCC TCCCCGACCA 300 GCTGGTCAGCAAGTCGGTCA CTCAGGGCTT CAGCTTCAAC ATCCTCTGTG TGGGGGAGAC 360 CGGCATTGGCAAATCCACAC TGATGAACAC ACTCTTCAAC ACGACCTTCG AGACTGAGGA 420 AGCCAGTCACCATGAGGCAT GCGTGCGCCT GCGGCCCCAG ACCTATGACC TCCAGGAGAG 480 CAACGTGCAGCTCAAGCTGA CCATTGTGGA TGCCGTGGGC TTTGGGGATC AGATCAATAA 540 GGATGAGAGTTACAGGCCCA TAGTTGACTA CATCGATGCG CAGTTTGAAA ATTATCTGCA 600 GGAGGAGCTGAAGATCCGCC GCTCGCTCTT CGACTACCAT GACACAAGGT CCACGGTTTG 660 GCTCTACTTCATCACGCCCA CAGGGCACTC CCTGAAGTCT CTAGATCTAG TGGCCATGAA 720 GAAGCTAGACAGCAAGGTGA ACATTATCCC CATCATCGCC AAGGCTGACA CCATCTCCAA 780 GAGCGAGCTCCACAAGTTCA AGATCAAGAT CATGGGCGAG TTGGTCAGCA ACGGGGTCCA 840 GATCTACCAGTTCCCCACGG ATGATGAGGC TGTTGCAGAG ATTAACGTAG TCATGAATGC 900 ACATCTGCCCTTTGCCGTGG TGGGCAGCAC CGAGGAGGTG AAGGTGGGGA ACAAGCTGGT 960 CCGAGCACGGCAGTACCCCT GGGGAGTGGT GCAGGTGGAG AATGAGAATC ACTGCGACTT 1020 CGTGAAGCTGCGGGAGATGT TGATCCGGGT GAACATGGAA GACCTCCGCG AGCAGACCCA 1080 CAGCCGGCACTACGAGCTCT ACCGGCGCTG CAAGTTGGAG GAGATGGGCT TTCAGGACAG 1140 CGATGGTGACAGCCAGCCCT TCAGCCTACA AGAGACATAC GAGGCCAAGA GGAAGGAGTT 1200 CCTAAGTGAGCTGCAGAGGA AGGAGGAAGA GATGAGGCAG ATGTTTGTCA ACAAAGTGAA 1260 GGAGACAGAGCTGGAGCTGA AGGAGAAGGA AAGGGAGCTC CATGAGAAGT TTGAGCACCT 1320 GAAGCGGGTCCACCAGGAGG AGAAGCGCAA GGTGGAGGAA AAGCGCCGGG AACTGGAGGA 1380 GGAGACCAACGCCTTCAATC GCCGGAAGGC TGGGTGGGAG GCCTGCAGTC GCAGGCCTTG 1440 CACGCCACCTCGCAGCAGCC CCTGAGGAAG GACAAGGACA AGAAGAAGTA GGTGGCAGGC 1500 TGCGCCTGCGCTGGCTCCTC TTGCTCCTGT GGGC 1534 414 base pairs nucleic acid doublelinear cDNA to mRNA NO NO G42con.seq 98 CAGGCATAAG CCACCGTGCC TGGCCTGGAATATTAGTTTT TATATAACTG GTGTAAAGGG 60 TCAAAGAGAT AATATATGTA AACACTTAGCCTGGAACCTG TCTCAAAGTA CCTACTCAAA 120 AAAATGCTAG CTGTGAAGAT GGTGATCCTGTTTAAGGAAG GGTGACTGCC TAAAAGAGAG 180 CAGAAAGTAG GACTAAAAAG GAATTATTTCAATTTGTACC ATCCATGCTG TCCACAGGAA 240 GGCAAAGAGA GAGACCTACA AAGTCTCTGTCCCCAACATG CACTCTGCCA AGTTATATAA 300 CTGTTCTGGT CTGAGACCCA TGCTTAGAGAGGGAGATTAT CCAGGAACCC AGTAGTATAA 360 CTTCTCTTTT CTTAACGAGG TCATGAAGGTAGGAGAAAGC TCCTCTGGCC TCAC 414 606 base pairs nucleic acid double linearcDNA to mRNA NO NO G105con.seq 99 TGGGAAGATA GGGATGGGAG TGGAGGGGCTGTGGGAAGGA GAAGGGTCAC TCAGGGACCT 60 GGCTGTGCCC CTTGCATCCT GACAATGGATCCACCACAAC TCTACCAGTC TGTATTAGGG 120 GAACATGAGC AAATGGCATC GTGTCTGTGCCAGTCACCAA GCACTGAGGG GAAGCTCTGG 180 AAGTTGCCGC CTGAACCTGC CCTCCAGTCTTGCAAATGCT GAGCAGGAGC CACCAGCCTT 240 GGACTGTCTG TGCTTCTTGC TAGAGCATGTGGGTCATTCC AGCCTTTCCC CAGAACGTCC 300 ATTCTCTCCA CACCTTCTTC ATTCCAAATGGGGATCCTTG CCTTTCTTTT GGACTCCAGA 360 GACATGCATA AAACCACAAC ACAGCTTTAGAAAACAAGGC ACACCTGTAT TAGTCTTACA 420 CCTAAATTGA ATGCAGCCTG CCATAAGGGAGGAATTACAG TCCTTCTAGA GGCCCAAGGT 480 ACCTGCAGCT CCCCCTGACC AGTCCTGTCAAAGCCTTGTT TTTGTCAAAA TGCCACCTTG 540 GACTCTGTCT GAGAGTTCTG CTGCCCACCAAGAGGGATGG ACAAAGTCTG TTTATCCAGA 600 AACTTG 606 421 base pairs nucleicacid double linear cDNA to mRNA NO NO G98con.seq 100 CATAATGGACCCCTCTAAGG ACTTCATGAA AGCTACGGAC CTCTCTCCAA AAAATGCTCA 60 CATGTAGTCTCTAACATTGT GCATATAATT TCGAGGGGTT TGGGATTCTC TAAGCCGTTA 120 ATGCTTCCTTGAGTTAAAAG CTTTAGAATT ATACAAATAA CCTGCTTATA AGAAATGGAT 180 CAAAACACTATTCTCCCTCC TGTCATAAAG TAAATGCCAA AACCACAGGC CACTTAGCTA 240 AGGGGCATCAGCCTTGTGGA CAAAAGAGTT CTGCTTTTCA TACCACTAGT GGCTGGTGAG 300 AGCTCCTTTCACTTTGCAGA GAGAATGCTG GTCTTCTTGG GACTACAGAG GCAGACACCG 360 TGGCACTACTACAGATCTAC AATCTAGCAC ATGTGCATGT GTGCATGATG TCAACCTCTC 420 C 421 392base pairs nucleic acid double linear cDNA to mRNA NO NO G73con.seq 101GTGACTGTGG AGGGCGAGCT GAGCCCTGGC CGCCGCCACA ATGGGCCGCG AGTTTGGGAA 60TCTGACGCGG ATGCGGCATG TGATCAGCTA CAGCTTGTCA CCGTTCGAGC AGCGCGCCTA 120TCCGCACGTC TTCACTAAAG GAATCCCCAA TGTTCTGCGC CGCATTCGGG AGTCTTTCTT 180TCGCGTGGTG CCGCAGTTTG TAGTGTTTTA TCTTATCTAC ACATGGGGGA CTGAAGAGTT 240CGAGAGATCC AAGAGGAAGA ATCCAGCTGC CTATGAAAAT GACAAATGAG CAACGCATCC 300GGATGACGGT TCCCTGTCTC TGAAAGACCT TTCTCTGGAA GAGGAGTCTG CATTGTAGTG 360TCTCAAAGAC ACAATAAACT TCCTATGGTC TG 392 2200 base pairs nucleic aciddouble linear cDNA to mRNA NO NO G89con.seq 102 CCACCCAGCC TCAGCACTCATCTGCGCAGC CATGGAGGCC CTGGGACCTG GGGGCGACCG 60 CGCCTCCCCG GCCTCGTCCACTAGCAGCCT GGACCTGTGG CATCTGTCCA TGCGCGCGGA 120 CTCGGCCTAC AGCTCTTTCTCCGCAGCCTC CGGCGGCCCC GAGCCGCGCA CGCAGTCGCC 180 GGGGACAGAC CTCCTTCCTTACCTAGACTG GGACTACGTG CGTGTGGTTT GGGGCGGCCC 240 GGGCCCCGCC CCGCCCGACGCTGCCCTTTG CACATCCCCG CGGCCCCGGC CCGCGGTTGC 300 AGCCCGCAGT GGGCCGCAGCCAACAGAGGT CCCGGGGACC CCGGGACCAC TGAACAGGCA 360 GGCCACCCCG CTGCTGTACGCGCTGGCGGC CGAGGCGGAG GCCGCGGCGC AGGCTGTCGA 420 GCCGCCCAGC CCGCCGGCCTCGAGGGCCGC CTACCGCCAG CGGCTTCAGG GCGCGCAGCG 480 GCGAGTGCTC CGGGAGACGTCGTTCCAGCG CAAGGAGCTC CGCATGAGCC TGCCCGCCCG 540 TCTGCGGCCC ACTGTCCCAGCGCGGCCCCC GGCGACTCAC CCGCGCTCCG CCTCGCTCAG 600 CCACCCGGGC GGGGAGGGGGAGCCGGCGCG CTCCCGGGCT CCCGCGCCAG GAACTGCCGG 660 CCGGGGTCCC CTCGCCAACCAGCAGCGGAA GTGGTGCTTC TCAGAGCCAG GAAAGCTGGA 720 TCGTGTGGGT CGGGGCGGTGGGCCGGCGCG GGAATGCCTG GGTGAGGCCT GCTCCAGCTC 780 TGGCCTCCCT GGGCCCGAGCCCTTGGAGTT CCAGCATCCG GCGCTGGCTA AGTTTGAAGA 840 TCACGAGGTC GGATGGCTGCCCGAGACGCA ACCCCAAGGC TCCATGAACC TGGACTCCGG 900 GTCCTTGAAG CTCGGTGATGCCTTCAGGCC CGCCAGTCGG AGTCGGAGCG CTTCAGGCGA 960 AGTCTTGGGT TCCTGGGGAGGATCAGGAGG GACCATACCC ATTGTCCAGG CTGTCCCCAA 1020 GGAGCAGAAA CCCCCAGACCATTGTTTCAG ACCAAACTTT CCAGGTTCTT GCCTCAGAAA 1080 GAGGCTGCGG TGATGTATCCTGCAGAGTTA CCCCAGAGCA GCCCTGCTGA CAGTGAACAG 1140 AGGGTCTCAG AGACCTGCATTGTGCCTGCC TGGCTCCCCT CCCTTCCTGA TGAAGTGTTC 1200 CTAGAAGAGG CCCCACTGGTCAGAATGAGA TCACCACCAG ACCCCCATGC CTCCCAGGGG 1260 CCCCCAGCCA GGTCCTATCAGTTCAGCTTC ACCCAGCTCC TGCCGGCTCC TCGGGAGGAG 1320 ACAAGGCTTG AAAACCCTGCCACCCACCCT GTGCTTGACC AGCCATGTGG GCAGGGGCTC 1380 CCTGCACCAA ACAACAGCATCCAGGGCAAG AAAGTGGAGC TGGCCGCCCG CCTCCAAAAG 1440 ATGCTTCAGG ACCTTCACACGGAGCAGGAG CGGCTGCAGG GGGAGGCACA AGCGTGGGCC 1500 AGGCGCCAAG CGGCTCTGGAGGCTGCAGTG CGCCAGGCCT GTGCCCCTCA GGAGCTGGAG 1560 CGGTTCAGCC GGTTCATGGCCGACCTAGAG CGCGTGCTTG GCCTTCTGCT GCTGCTGGGC 1620 AGTCGCCTGG CGCGCGTGCGCCGCGCCCTG GCCCGGGCGG CCTCAGACAG CGACCCTGAT 1680 GAGCAGCGAC TCCGGCTCCTGCAGCGGCAG GAGGAGGACG CCAAGGAGCT GAAGGAGCAC 1740 GTAGCGCGGC GCGAGCGGGCCGTGCGGGAG GTGCTGGTGC GAGCACTACC GGTGGAGGAG 1800 CTGCGCGTCT ATTGCGCCCTGCTGGCGGGC AAGGCCGCCG TCCTGGCCCA GCAGCGCAAC 1860 CTGGACGAGC GCATCCGCCTCCTTCAGGAC CAACTGGACG CCATCAGGGA CGACCTTGGC 1920 CATCATGCCC CGTCTCCCAGCCCGGCGCGG CCCCCAGGGA CCTGTCCTCC AGTTCAGCCG 1980 CCCTTCCCTC TTCTCCTTACATAAGATACC ACTGGGTCAG CCAGGCCTGA GGCGGGCAGT 2040 CGAGGGTGGG AGCTGAAGGGAAGCCATGTT CGGCGGTGCC CGAAACCGGC GCGCAGTCTG 2100 TCTTGAACAT CCTGCTCGGCACAAAACTTA CCCCTGAGAG CGGCTGGCGC AAACCTCAGG 2160 GCTCCTCATT GGAACAAATTGCCGTGCTGT GCATTCACAT 2200 371 base pairs nucleic acid double linearcDNA to mRNA NO NO G102.seq 103 GGGGGATCCA GCTCAGAAGC AGAGTGTCCACGCCAGGGAA TAGTGTGGGG ATTCAGAGCC 60 TGATAATGAT GAGAAGGGGA CCCACCTGAGGGTTAAGTCG GCTAGGGGGA AGTCAGATCA 120 TAGAGTAGAG ACGGCATTCT TGCGAGAAGCCACCTGGTAT AAAGTATCAG ACCGAGAAGA 180 GTGACCCTCT CAGTGACACA GATCTGGGGAGATTCAGGTC AGAGTACAGT GGGCATCCCT 240 GCAAGAGGCC ACCTGGTATC AGAGAAGGGCGGGGAATGAG GACATGATCT AGCACCAGAA 300 GTCAAAGTGT ATACAGAATG GAAAAGCATCCCATGAGGGA GTCGGAATGA AGAGTCAAGA 360 GCCTACGCAG G 371 407 base pairsnucleic acid double linear cDNA to mRNA NO NO G57.seq 104 GGCAGAGGAGGACAGGCACC TACTGCATCC AAGCCTTAGG ATATGAGTCA TGTTCCAAAG 60 GTGGGATTGGGAAGGACAAT CAGGGCGTGA TAGTAATATA TGCTGAGTAG AGGCACTAGA 120 CATGGGAAGCAACTAATTCC AACTGAAGAC CCATAGGTGT GGGAGAGAGA GGCTAGAGAG 180 GTCAGCAGGTCCTGAACATC TGCAGAAGGT GGATTGTCCT GTTGGCTCAG GGAGCTTAGG 240 CTTCAAGCCCCCTCACTTGC ATCAGCCCCT TCCAAGGCCC TGCACTTCAA TTTTACCTGG 300 TTTTTCTTAGAAAGGGCCCT CAATATTGTA AAAGCTTGAA GTCTCACAAA TCCCTGGATC 360 TGCTGCTGATGCCCTGTAAC TTGAATGAAA CCATTCACCA TTTAGGG 407 235 base pairs nucleic aciddouble linear cDNA to mRNA NO NO G108.seq 105 GTCCTCTGGG AAGCAGACACCAAAACAGAA TGAAAAGTGC AGAAGATTTA TTGGGGGTAA 60 GAAGAGCTAA TGCCTATGAAAGATAAAGGA GAAAGGAGCA GAAGTACGGA GAGAAAAGAC 120 AGCTTTCAGA CTGCAGTCCAGATCTAACTC TGGGACGCAA GAGAGGGAAG GATAATTCTG 180 TTGAAAGAGC ATCAGACTGTGATGCGGCTG TAAGAGTGTC TCAACGAGCC CAGTG 235 397 base pairs nucleic aciddouble linear cDNA to mRNA NO NO G127.seq 106 GGCAGCTGAT AAACAAACAGGGCAAGCACA TTCAGGCCAG AGCAAGGGGA AGCCCCTGAG 60 TCCCCTCTAT GTGCTCTCTGGCAAGATCTA CTTTCTGAAG CATTGACTGG AAATAGAAGT 120 CTCGCCGGGC TGGCTGGAGCCAGAGGCCCC CACACCTTAT CCCCTTTGGA ATCTGCCAGA 180 GGGCAGGTCT GAGTATGGACTTGGATGATC AACTTGGTTA ATATTCAGGC TATCTTGACA 240 GTCTCCACAC CCGTGAGCAATGTCCCAGGC AGCCTGCAGG CCTGATAGAA ACTCCACAAA 300 CCCGCCTATC ACGGAAGGTTTTCCCCTTTT GTCGGGGCCT ACCCAGACCC CAGGGGAGGT 360 GCATCCTTGA AAGCCGCTATGTGAAGTCCC ACATAGT 397 266 base pairs nucleic acid double linear cDNA tomRNA NO NO G86.seq 107 CCACCCCCTG TGGCCTTCTT TAACCATGCT GGCTAATTCAGGATCCCTAG TTCCTTATGA 60 CTTTCCTTTA AAACGTCTAC CAGAAATTGG GGGAAAAAAAGTGTTATTAT AGGATTAATG 120 TAGGTCTTCC CCACTATACT GTGAATATCA TTGAGAGCTTGGTCCCTACA CCTTAAATCC 180 CCCATCGTCA ACTATTTTTT CCCATCTCAG TGTCCCATGATCAAGGAGAC CCTCCCTGAA 240 TGTCCAGTTC CCCAACCCTT ACCCCC 266 370 basepairs nucleic acid double linear cDNA to mRNA NO NO G78.seq 108GTAGGTGATG GGATGATGGT GAAATACAGG ATCAAGTACT CAACTCCAAC CTGATGGCCA 60TACCCAGGAC AAATGCTGCC CCATAGTTGG AGATCTGGCC CATGCCTACA AGGACAAACA 120GCACGACAAA CATCTCAAAA TTCTTCGAGA AGGTCTGCAG GAAGCTGAAG CCTGTCTGCA 180CGCCCATGGT CACGAACAGC ACATTCTTCC GGCCAAACCT GGGAAGGAAA GGAGAGTGAC 240AGATAACCAG CTGGAAAAGG GCAGCAGGAA TGGGCTCCAC CAAGTGGGGC TTTCTCAAGA 300TCCATCCAGT AAGTGGGTGT GAACAGTGTT GCCAGAATAC TGGCTGCCAG GGACAGTCTC 360GGTCTCACAG 370 481 base pairs nucleic acid double linear cDNA to mRNA NONO H993.seq 109 GGGAGAAATA GCATGGGCAC TGTGAGACCG AGACTGTCCC TGGCAGCCAGTATTCTGGAA 60 ACACTGTTCA CACCCACTTA CTGGATGGAT CTTGAGAAAG CCCCACTTGGTGGAGCCCAT 120 TCCTGCTGCC CTTTTCCAGC TGGTTATCTG TCACTCTCCT TTTCTTCCCAGGTTTGGCCG 180 GAAGAATGTG CTGTTCGTGG CCATGGGCAT GCAGACAGGC TTCAGCTTCCTGCAGATCTT 240 CTCGAAGAAT TTTGAGATGT TTGTCGTGCT GTTTGTCCTT GTAGGCATGGGCCAGATCTC 300 CAACTATGTG GCAGCATTTG TCCTGGGTAT GGCCATCAGG TTGGAGTTGAGTACTTGATC 360 CTGTATTTCA CCATCATCCC ATCACCTACC TTTCTGGAGA CAGCTGTAATGTCCCTCAAG 420 GGGGACAGGG TTTCTAACAA AACTAGCCAG AGCTTCCTGG TGAACCTTACTTACAGGCAG 480 G 481 670 base pairs nucleic acid double linear cDNA tomRNA NO NO G38a.seq 110 GGGGCAAGTA GCCGGTGCGG GTGGTAGAAC TGGCAGATAAAAGGGCCTGT GGTGAGACTC 60 CAGGTGTGGT TGTATAGGGG GTTGAGGGAG GTAAGCGCGGAGGCGGGATC GAGCAGGGGT 120 CCTTGTAGCC GCCTAAGAAG TGCAGTGGTG AAGCTGACTCCTGTGAGGTG GAGGGGAGGG 180 GTCTGGAAAC AGTGGAGATA CAGCAGCCCT GGGCAGAGCAGAGGAGCCAG GTGAACCCTA 240 CCTTACAGAA ATCTTGTACC CTGGCTGAAG GACGGGCAGGGAGGGGTCGT GAGGAACCCC 300 CTCGCCGGGA TCAGGAAGCC TAGGTCAGTC CGGGTTACATAGCTGACCTG CTGTGGGACC 360 TCGGGGACCA ACACCCTCGG TTTCTGGTCC CAGGAGATGGACAAGGACGC AATGTCTGTT 420 CCTGGCCTTG GCTCAGGGCC TAATCTGATC CGCGGATGGTCCTTGCCATC AGGGAAGGGG 480 GACGCAAGAA CTCGGCGGGG GTTTGTGGTG GGGTCGCAGAGAGCAAGCCC TATATCTCCC 540 TCCGCAGACC CAGGTGCTCC CCAAACCCGG CCCGGAGCCCGCGAGAACTG GGGGCGGAGG 600 GTGTACTTAG GCGGCCCTGG GGACCTTGAC GGGACAGCTCAGCAGCAGGG GATGGGGGCT 660 CGGCGGCCGC 670 408 base pairs nucleic aciddouble linear cDNA to mRNA NO NO H90.seq 111 GTCACCAGCA CCTTGGGCTGGGTGTCAGAG AGCTCACAGA ATGTGGATAA CCAACCAGGC 60 AGATGTTGGT AACAGCAACCAGGAGGGCAC AGCACAAACC TGAGCAGGTC TTTTATGTAT 120 GTGAAGGTGA AGGAGTTATGATTTAGAAAT GGCAGTGGGA AGCAAGGAGA ATGCTGAGGG 180 CCTGCTCAGC TCTTGTCTTCCAGGATCATG GATAGTGCAA AATGAGTAGC CTTCATTTGA 240 GAGACAGAGC CATGAGGCTAGTGGAGTGCT CAGAAAGAAG CCAGATCTCT ATCAAGGAAA 300 GGAGATGGAG AGAACAACCAGGGATGTACT GAAAGGGGAG AGTTGCATGT CTCCAATGGA 360 ATATGTGTTG CAGAGGACTCAGTCACAGAG AAGACAACTC CAGGAGGG 408 254 base pairs nucleic acid doublelinear cDNA to mRNA NO NO G66.seq 112 GGCGATCAAG AAGGATGCCC CCCAGGACAGTGACTCTGCT GGACTTCTCT ACAGAAAACA 60 GTATATCCCT CAGTGGCATG AGAAGATCCAATAGGGTCAC CACACTCCAC AACTGCAGGG 120 GACACTGTTC ACATTTTAGT CTATGCAGCCTCTGGTGGCC AAAGATTAAT ATGAGAACAC 180 CTTTGCTGTG TGACCTGAAG TTCATGGGCAGTAAATTGTA GCTATTGTTA TGCACGACTT 240 TGGGCGAACC AGGG 254 345 base pairsnucleic acid double linear cDNA to mRNA NO NO H973.seq 113 CACAGGAGGATCACATAGGG TCACCACACT CCACAACTCC AGGAGATGCT GTTAACATTT 60 TAGTCTATGGATCCTCTGGT GGCCTAAGAT TTAAATGAGA GCACTTTTGT TATGTGACCT 120 GAAGTTAATGTCAATAAGTT ATAGCTATTG TTATGTACGA TGTAAGCAGG GGTCACTGCA 180 GGCCAGAAGGCTGACACAAT TTGGCCAGGC TTTGTTCTTC AAGGAAGGGC AGGGCTCTGA 240 GAAGTGCAGACCGTGATGCA GGTGAAGGCC AGGAGGCAGG GACTCCCAGG GCAGGTCTGG 300 AAGGAGCGAGGCTGGTGACG GAAGTGGTCA GCAACCTCAA GGCGT 345 413 base pairs nucleic aciddouble linear cDNA to mRNA NO NO H505.seq 114 GGGAGATCTG CTGCTCTTTTCAGAGCCAGC AGGTAGGAAC ATTTAAGTCT GCTGAAGCTG 60 TGCCCACAGC CGCCCCTTTCCCCACGTGCT CTGTCCCAGG GAGATGGGAG TTTTATCTAT 120 AAGCCTCTGA CTTGGGCTGCTGCTGTTCTT TCAGAGGTGC CCTGCCCAGA GAGGAGGAAT 180 CTGGTGAGGC AGTCTGGCTACAGAGGCTTT GCTGAGCTGC GGTGAGCTCT GCCCAGTTCC 240 AGCTTCCTCG TGGCATTGTATACACTGTGA GGGGTAAACC ACGTACTCAA GCCTCAGTAA 300 TGGTGGATGT CCCTTCCCCCACCAAGCTTG AGTGCCCCAG GTCTACTTCA GACTGCTGTG 360 CTGGCAGCAA GAATTTCAAACCAGTGGATC TTAGCTTGCT GGGCTTCTTG GGG 413 283 base pairs nucleic aciddouble linear cDNA to mRNA NO NO H989.seq 115 GGTGCCTATG CCACCAGGGCCCTGGGTTTC AAGCATAAAA CGAAGTGGCC GGTTCACCAG 60 ACACCGAGCT AGCTACAAGAGTTTTTTTTC ATACCCCAGT GGCAGTGGAA CACCAGCGAC 120 ACAGAATCAT TTACTCCCCTGGAAAGGGGC TGAAGCCAAG AAACCAAAGG GGCTGGCTCA 180 GCGGATCCCA CTCCCATGGAGCCCAGCAAG CTAAGATCCA CTGGCTTGAA ATTCTCGCTG 240 CCAGCACAGC AGTCTGAAGTTGACCTGGGA TGCTCGAGCT TGG 283 768 base pairs nucleic acid double linearcDNA to mRNA NO NO E118con.seq 116 TGCCTGGAAT TATTATATGC TCATCACTTTATGAAGAATA AAATTTGTCT TTCCTGCCTT 60 AAAGTTACAT TCGTTCTTCC GCTCAAATCCTGATCTGGTC CATTAAAGAG TGTTCGCAGA 120 CAAAGTTTCT GAAAGATTAG AGAAGAATCCCCCCCAAGAT TGCCCCAACA CTGAACTACA 180 GACAAACACT ATTTTATTTA AATAAGGAGACAGCTTTCTA AAAGTATACA TTCTCTAATA 240 AAAATAGTTT ATTATTTTGA ATGATTTAATGGTTTTCTAC ACAATTTACA TCACAACATG 300 TAAATTTTAG CAGTAACATC TGATTCTAACAGCACATCAT GCTATTCCTT TCATAGAGCC 360 TTCAGAGATT CAATGCTAAA CAAATTTCCTTAGTTGGCAT CAAGGCACTG ATCACTTTAG 420 AGGCTTTTAA GAAATTATTT AAAGATGCAAATGCCTCTGA GTGAAGTGTA CTATCCCATC 480 ACTGAAGCCC ACAGGAACAA GTCCTACAATTTTAAAAAGG CTCGATGGAA AAATTTCTCA 540 ATCCTGAAAT CCCCTAGGGA AGGGGTCAGGAGAAAGTGCC ATGGTTGATA TTTAAGAACT 600 CCACAGCTCT TAAAAATAAG CACTTATCCCTAACATGCAA TACTGCAGAT GCAAGTTAAA 660 CTTATCTGTT AACAGCTGCC TGCTGTTTTCTGCTCCCAGA TGAAATGAAG CAACTCTTCT 720 GATAACGAAG AGATACCTGT CTGAGGCAAACGAAACATTG GCACACAG 768 493 base pairs nucleic acid double linear cDNAto mRNA NO NO E69f.seq 117 GGGTCATTTT TGCTGTCACC AGCAACGTTG CCACGACGAACATCCTTGAC AGACACATTC 60 TTGACATTGA AGCCCACATT GTCCCCAGGA AGAGCTTCACTCAAAGCTTC ATGGTGCATT 120 TCGACAGATC TCACTTCCGT TGTAACGTTG ACTGGAGCAAAGGTGACCAC CATACCGGGT 180 TTGAGAACAC CAGTCTCCAC TCGGCCAACA GGAACAGTACCAATACCACC AATTTTGTAG 240 ACATCCTGGA GAGGCAGGCG CAAGGGCTTG TCAGTTGGACGAGTTGGTGG TAGGATGCAG 300 TCCCAGAGCC TCAAGCAGGG TTGGGTTCCC ACTGGCATTGCCATCCCTTA CGGGTTGACT 360 TTCCATCCCC TTGGACCCAA GGCATTTTTA GCACTTGGGTTCCCAGCATG TTGTCACCAA 420 TCCCAACCAA GAATTTGGAA AAATTNTACT GNGTCGGGGTTGGTAGCCAA TTTCTTATGT 480 AGTGTGNTCC CTA 493 483 base pairs nucleic aciddouble linear cDNA to mRNA NO NO E69r.seq 118 CAGCCAAATT CTACTGGAGGTACAAAGAGG AGTTGGTACC ATTCCTTCGG AAACTATTCC 60 AATCAATAGA AAAAGAGAGAATCCTCCCTA ACTCATTTCA TGAGAACAGG ATCATCCTGA 120 TACTAAAGCC GGGCAGAGACACAACAAAAT NNGGAATTTT AAGCCAATAT CCCTGATGAA 180 CATCAATGCA AAAATCCTCAATAAAATACT GGCAAACGAA ATCCAGCAGC ACATCAAAAA 240 GCTTATCCAC CATGGTCAGGCCGGGTTCAT CCCTGGGATT CAAGGCTGGT TCAACATATG 300 CAAATCAATA AATGTNATCCATCACATNAA CAGAACCCAA CGNCAAAAAC CACATGATTA 360 TCTCAATAGA TTGTAGAAAAGGCCTCCGAC AAAAANTCAA CAACCCTTCA AGCTAAAANN 420 TCTCAATAAA CTATGTTTTGATGACATATT CAAAATTATA GAGTATTTGA AACCACGGCA 480 TTA 483 707 base pairsnucleic acid double linear cDNA to mRNA NO NO E36.seq 119 CGAGGACCAAACTCAGGACA CCGAGCTTGT GGAGACCAGA CCAGCAGGAG ATAGAACCTT 60 CCAGAAGTGGGCAGCTGTGG TGGTGCCTTC TGGAGAAGAG CAGAGATACA CATGCCATGT 120 ACAGCATGAGGGGCTGCCGA AGCCCCTCAC CCTGAGATGG GAGCCATCTT CNCAGTCCAC 180 CATCCCCATCGTGGGCATTG TTGCTGGACC TGGCTGTCCT AGCAGTTGTG TCATATCGGA 240 GCCTGTGGTCGCCACTGTGA TGTGTAGGAG GAAGAAGCTC AGTGGAATAA GGAGGGAGCC 300 AACTGTCAGGCTGCCGTGCC AGCGACAGTG CCCAGGGGCG CTGATGTGTC TCTCACAGCT 360 TGGGAAGCCTGAGGCAAGCT GTGCTTGTGA GGGGCTGAGA TGCAGGGATT TCTTGACGCC 420 TCCCCTTTGTGACTTCAAGA GCCTCTGGCA TCTCTTTCTG CAAAGGCACC TGAATGTGTC 480 TGCGTCCCTGTTAGCATAAT GTGAGGAGGT GGAGAGACAG CCCACCCTTG TGTCCACTGT 540 GACCCCTGTTCCCATGCTGA CCTGTGTTTC CTCCCCGTCN CTAATTAGAT GACGAGGCAT 600 TTGGCTACCTTAAGAGAGTC ATAGTTACTC CCGCCGTTTA CCCGCGCTTC ATTGAATTTC 660 TTCACTTTGACATTCAGAGC ACTGGGCAGA AATCACATCG CCTCAAC 707 324 base pairs nucleic aciddouble linear cDNA to mRNA NO NO A104f.seq 120 CATGAAGTCA CTGAGCCTGCTCCACCTCTT TCCTCTCCCA AGAGCTAAAA GAGAGCAAGG 60 AGGAAACAAC AGCAGCTCCAACCAGGGCAG CCTTCCTGAG AAGATGCAAC CAATCCTGCT 120 TCTGCTGGCC TTCCTCCTGCTGCCCAGGGC AGATGCAGGG GAGATCATCG GGGGACATGA 180 GGCCAAGCCC CACTCCCGCCCCTACATGGG TTATCTTATG ATCTGGGATC AGAAGTCTCT 240 GAAGAGGTGC GGTGGCTTCCTGATACAAGA CGACTTCGTG CTGACAGNTG CTCACTGTTG 300 GGGAAGCTCC ATAAATGTCACCTA 324 387 base pairs nucleic acid double linear cDNA to mRNA NO NOA104r.seq 121 ACTACTACTA CTACTAACTC GAGAATTCTG GATCCTCGGC TTAGTTTGCTTCCTGTAGTT 60 AGTAGCGTTT CATGGTTTTC TTTATCCAGT GTACAAAGCT TGAGACTTTGGTGCAGGCTC 120 GTGGAGGCAT GCCATTGTTT CGTCCATAGG AGACAATGCC CTGGGCCACCTTTGTTACAC 180 ACAAGGAGGG CGCTCCAGAG TCCCCCTTAA AGGAAGTCTT TTNAATCTCTGGGTCCCCCA 240 CGCACAACTC AAGGGTACTG TCGNAATAAT GGCGTAAGTC AGATTCGCACTTTTCGATCT 300 NCCTGCACTG TCATCTTCAC CTCTAGGTAG TGTGTGTGAG TTGTGATGCCCAGGGGGGGN 360 CCNNCTGNCC CCAGNCGGGN CANACTN 387 562 base pairs nucleicacid double linear cDNA to mRNA NO NO H622.seq 122 GGAACGTCTG AGGTTATCAATAAGCTCCTA GTCCAGACGC CATGGGTCAT TTCACAGAGG 60 AGGACAAGGC TACTATCACAAGCCTGTGGG GCAAGGTGAA TGTGGAAGAT GCTGGAGGAG 120 AAACCCTGGG AAGGCTCCTGGTTGTCTACC CATGGACCAG AGGTTCTTTG ACAGATTTGG 180 CAACCTGTCC TCTGCCTCTGCCATCATGGG TAACCCAAAG TCAAGGCACA TGGCAAGAGG 240 GTGCTGACTT CCTTGGGAGATGCCATAAAG CACCTGGATG ATCTCAAGGG CACCTTTGCC 300 CAGCTGAGTG AACTGCGCTGTGACAACCTG CATGTGGATC CTGAGAACTT CAAGCTCCTG 360 GGAAATGTGC TGGTGGCCGTTTTGGCAATC CATTTCGGCA AAGAATTCAC CCCTGAGGTG 420 CAGTCTTCCT GGCAGAAGATGGTGACTGGA GTGGCCAGTG CCCTGTCCTC CAGATACCAC 480 TGAGCTCACT GCCCATGATGCAGAGCTTTC AAGGATAGGC TTTATTCTGC AAGCAATCAA 540 ATAATAAATC TATTCTGCTA AG562 692 base pairs nucleic acid double linear cDNA to mRNA NO NOG61con.seq 123 GTGGGAGGTA GCCACCTGTT CTGGGCTGTG TCTGTCCTGC TGCCTGCTGGAGAGGCCAGC 60 AATGAGTCCT GGGCCAGCCC AGATCTCACC TGTGTGTTGA ATACTAAACAAGGAGACAAG 120 TAAAATAAGT CCAGCATGAG TCAGATGCTA GGTCTTGGCT TGGGGAAGCATGCCCTCAAG 180 TGCCATAAAC ACCTAGAGGA CAAATGGGAG CAGAGGATCA AGAGCTTCTGCCTGCCTGTA 240 CAGCACCTTT GGTGCAAAGT AGGAAGAAGT CTCACTCTGG GTGGATAACTTTCTTAAAGG 300 CACACCTCCC TCTAGGCTAA GGCAGCCCCA TGCCGCAGGG TCTAATCTTGTCAATCAAAA 360 TACCCACCCA TCAGTGACAA TATGAGTGGC TTCTGCAGCA TTCAGGGGAATTTTGTCAGA 420 GATAGGGAGG CCAAGATCCA AGTGGAGGAA GCCTGACTAG CAGAGTCTGTGGAAGAACTG 480 CAATGGGGGA TGAGTCTTCA GGGTCTTGTG CCTGAGCAAT GTGGGTTGTGGGAGAGGATT 540 CTGGAGAAGG TTTTATTTGG ATGGTAGAGG ATCCCTCCAT TTAGCTGCTGAGTCAAGAGG 600 AAGAGAGTGG AGTCCAGGAG GGTAGTAGGA GGTCGTTATG ATGTTATGGATAAGAATAGA 660 TGTGGTCCAA GGATGGCTTG AGTCATGGCT GG 692 289 base pairsnucleic acid double linear cDNA to mRNA NO NO G45.seq 124 CCCAAGATAGGCCGGGGCAG GGAGAAGTAG GGATGGATGG GATCCCCACA GTGTGACATA 60 GCATGGCTGTGAATGAGGTG GGTGGGCGAG GGTGAGCCCC AGAAGCCAGG ACATCTGGAC 120 TCCAGCTAAGGGTGTGGAAA CAGGCTATGA AGATCGCCAG GGAGAAGTGA CTCATAGTTG 180 TCCCCACCTGACTACTCCGT TGACTACACT CCTGGTGCTG GGAAAGGCCT CCCTGCCATC 240 CAGTCTTTCTCTCCTCTCTC CACTCTGCAG GAACAACTCC AGCCTCTAC 289 1200 base pairs nucleicacid double linear cDNA to mRNA NO NO G3con.seq 125 ATGTATGTTTGGCTCTGCTT TTAACTTTAT AAATCCAGTG ACCTCTCTCT CTGGGACTTG 60 GTTTCCCCAACTAAAATTTG AAGTAGTTGA ATGGGGTCTC AAAGTTTGAC AGGAACCTTA 120 AGTAATCATCTAAGTCAGTA CCCACCACCT TCTTCTCCTA CATATCCCTT CCAGATGGTC 180 ATCCAGACTCAGAGCTCTCT CTACAGAGAG GGAAATCTCC ACTGTTGCAC ACCCACCTTT 240 GGAAAGCTCTGACCACTTGA GGCCTGATCT GCCCATCGTG AAGAAGCCTG TAACACTCCT 300 CTGCGTCTATCCTGTGTAGC ATACTGGCTT CACCATCAAT CCTGATTCCT CTCTAAGTGG 360 GCATTGCCATGTGGAAGGCA AGCCAGGCTC ACTCACAGAG TCAAGGCCTG CTCCCTGTAG 420 GGTCCAACCAGACCTGGAAG AACAGGCCTC TCCATTTGCT CTTCAGATGC CACTTCTAAG 480 AAAAGCCTAATCACAGTTTT TCCTGGAATT GCCAGCTGAC ATCTTGAATC CTTCCATTCC 540 ACACAGAATGCAACCAAGTC ACACGCTTTT GAATTATGCT TTGTAGAGTT TTGTCATTCA 600 GAGTCAGCCAGGACCATACC GGGTCTTGAT TCAGTCACAT GGCATGGTTT TGTGCCATCT 660 GTAGCTATAATGAGCATGTT TGCCTAGACA GCTTTTCTCA ACTGGGTCCA GAAGAGAATT 720 AAGCCCTAAGGTCCTAAGGC ATCTATCTGT GCTAGGTTAA ATGGTTGGCC CCAAAGATAG 780 ACAGGTCCTGATTTCTAGAA CCCGTGACTG TTACTTTATA CAGCAAAGGG AAACTTTGCA 840 GATGTGATTAAAGCTAAGGA CCTTAAGACA GAGTATCCTG GGGGTGGTGG TGGGGTGGGG 900 GGGGGTCCTAAATGTAATCA CGAGTAAGAT TAAGAGCAAA TCAATTCTAG TCATATATTA 960 AACATCCACAATAACCAAGA TATTTTTATC CCAAGAATGC AAGATTTCAG AAAATGAAAA 1020 ATCTGTTGATAAATCCATCA CTATAATAAA ACCGAAGGTG AAAAAAATTC TGAAAAAATT 1080 CTAGCAGCTATATTTGATAA AATTCAACAT CTCCTAGCTT TAGCAAACTC ACAGTTTTGC 1140 AAATAATATTTTCTTAATGT TATCTGTTGC TAAATCAAAA TTAAACAGTC ATCTTAACTG 1200 319 basepairs nucleic acid double linear cDNA to mRNA NO NO G30.seq 126ACGTGGTATG AAGTGTAAAG TTCTACTTTT AATTTTTTGC ATATTTTATT AGGATAGGAT 60GGGCTTTTTC TGTAGTAATA ATCCCTAAAT CTCAGGGGCT TAATATATAA AATTGTCTCA 120TGCAAAAAAC CACTGGGTCT AGGGCAATTG CTATCTACTG CCGTCTAATC TCCCTCTAGT 180GGCTTCCATT GGTAGACCCT AACAGGAAGC CAGCTGATAA GGGAATCTGG GAAATGTAGT 240TTACAGAGTG GCAGCTACAG TAGAACAGTA GAGACTACAA GGATGAGCTT GCAGCTGAGA 300ATAGAAACGT GACTGGCAC 319 383 base pairs nucleic acid double linear cDNAto mRNA NO NO G32.seq 127 CAGAACTTGA ATCTCTTCTG TTAATGGCAA CTCCATTATTCCAATTGCTC AGGCAAAAAT 60 TGGCATTAAC CTTGATTCTT TATCTTACAT CTTATATCTAATTCGCCAGT TTAATACTAT 120 GGGTTCAATT TTCAAAACAT CCGGAATCTG ACCATGCCTCACCATTTAAA CCAGCAGTCC 180 CCAACCATTT TGGCACCAGA GACCGATTTA GTGGAAGACAATTTTTCCAT GGACGGGTGG 240 GGTGAGGGGG ATGGTTTCGG GATGAAACTG TTCCACCTCAGATCATCAGG CATTAGTCAG 300 ATTCTCATAA GGAGCATGCA ACCTAGATCC CTCACGTGCAAATTCACAAT AGAGTTTGCA 360 TTCCCGTGAG AATCTAATGC CAC 383 407 base pairsnucleic acid double linear cDNA to mRNA NO NO G37.seq 128 GGATAATGCAAAGGAAGACG CTGCCTGGGA ATTCACCGTC TGTGGAAATG AGTCCCAGAG 60 AGGAATAAAGCAGCCCTCAC CTTGCTCTCC CCACCCGAAC CCACTTTCCC CACCCGCCTC 120 GGCCCCCACCCCAACACCAC CATCACTCCC TTCCCTCCCT CTACTGCAAT CAGCTATTTT 180 CCATCATTCTTACCTCCCTC TCTTACACCA TTCTTCATAG AACAGCCTAT TGTATTTTTT 240 AAGAGACTGTGTTCCTCCTC CACTTCTGTT CAATGGCTTC ATATTCACTT AAATTAAAAT 300 TCAAACGTTTACCACGGCTT TCAATGACCT GAATGATGTG CCTGCTGCCC TCCTTTCCAA 360 TCTCACTTCAGGACTCCTAC CCTCTGGCTA TTAGGAGGCT GCAGCTG 407 428 base pairs nucleic aciddouble linear cDNA to mRNA NO NO G39.seq 129 GGGTGGGGTA GGGGGAAGAGGTGGACATCA AAAAGGACCT GACTCCAAGA TGATATGCAA 60 TAATTAACCA TTGGAGGGCAGAAAGAGACT AAACACTTTT TTTTTCTTTT TAATGAATAA 120 TTGCTAATAC TCTGGAGATGAAATACTTCT AACTCCAAAT CTATTTGTGC TTTACATTTT 180 ACGTTTGGGG TTAGCTTTGTAAGGTGACAA GCCACCTTAG GTATAAGAAA CAATGATTTT 240 CCCAAATGCT GACTTTATGAAAGGCCTATT ACTCCCCCAG AGTATTTATT GTTAGAAGTA 300 ATGGTTAAAA TATATGATTGCCTAGAAAGG AAGTAAAAAA TGAAAATCTG AAACCCGTGG 360 TGAAAAGAGT GAGGCAGCTGTAACCTATTC CTCAACTTCT GAGTGTTAAC AGGGCCCGTG 420 TGGGGTTG 428 435 basepairs nucleic acid double linear cDNA to mRNA NO NO G75.seq 130GAATTCTGGA TCCTCCCTCA GTGGGTCCCT CTCAAGAGGC CATTTAAAAA CCTGGACTGA 60TAGAAACAGC CAGTACTTTG TGCCTCCTGC ATCCCATGTT GGAGACAATT GCCCTAACCA 120CCCAGAGCAT TGCTCAGCCT ATAAACCCAT TTCCAAGGAT AGGGCCTGAC TTCTTTGAGG 180ATCATGAGTA TGATTTCCAG GTCTTTTCTG ACCTCATTAA TGACCTTCCT GCTATGCACT 240GGTCTCTAAA CCCCTTGGCC GTGATTGTGA TGTGGAAATA AATAGAAGGT GCTTTATTCT 300TAAGCAGAGA TTCAGTGGCA GAGGGTTTGA TTTTGGAAAA GAGAAAGGGC GCAGGATCAA 360GTGAGAATCT TGTAGAATTG TGAGGCCAGA GGAGCTTTCT CCTACCTTCA TGACCTTGTT 420GAGGATCCAG AATTC 435 373 base pairs nucleic acid double linear cDNA tomRNA NO NO H100.seq 131 CCCCATCCTG CCCGCTGCTA CCAGCTACAC CTCCGTTGCCCAGGCCCTGG GATCAAACCC 60 TGCCCACCAG CTCCCCTCGT CCAACCAGCT GCCACGTCCTGTAACCAAAA GTGACCGGGA 120 TGAATGCCTG GCTCCCCCTC CTTTCCAGCC CTAGCTCAGGCCCATCGTCC CCAGCTGATG 180 TCGCCCTGTC TGCACGATGC CTGGGCACCT ACTCCACACTCCTCACTGGC CTCAGGCCCC 240 ACCAGCCCCT GCCTCGAGCT AGCCCCTCCA CCCGTCATCACTCCTGCCAG ACTCCAGATG 300 TCCAAGGTGC TCCTTGGCTC CCACAAGCTC TCCTCCAGCACCCCATCTTC CCCTGGTTGC 360 CCCTCGGTTC CCC 373 312 base pairs nucleic aciddouble linear cDNA to mRNA NO NO H414f.seq 132 GATCCATATC TGGGGGAAGAGGATTCTATG CTTGACTGAA TATGGGATGT GAAGGAGAAG 60 AAGTTGTGGC CTCAATCTACCCAATTGGGA GACTGGTGCA TGGGCCATGG TAGTGCCAAA 120 ACATAGAGCT ATTAAGGTAAGGAATGCAGG AGGGAAGAGT AGGCATGGTG GAGAAGATAG 180 AGAAATCTAG TTGTACTTAGTAAGTTTGAG GTAGGCTGAA ATTCAGGTAA CAGTTTTCTT 240 AGTAGGCAAT TGGGGTGAGAGATTTTGGAA ATTTACCCTT TAGATCAATT TTTGGGGAGG 300 ATCCAAGAAT CT 312 503base pairs nucleic acid double linear cDNA to mRNA NO NO H631.seq 133CAGAAATGAG CAAAGGCAAA GGGGAATACA GTGGGTCTGA GGCTGGCTCA CTGTCCCATT 60CTGAGCAGAA TGCCACTGTT CCAGCTCCCA GGGTGCTGGA GTTTGACCAC TTGCCAGATC 120CTCAGGAGGG CCCAGGGTCA GATACTGGAA CGCAGCAGGA AGGAGTCCTG AAGGATCTGA 180GGACTGTGAT TCCATACCGG GAGTCTGAAA CACAAGCAGT CCCTCTTCCC CTTCCCAAGA 240GGGTAGAAAT CATTGAATAT ACCCACATAG TTACATCACC CAATCACACT GGGCCAGGGA 300GTGAAATAGC CACCAGTGAG AAGAGCGGAG AGCAAGGGCT GAGGAAAGTG AACATGGAAA 360AATCTGTCAC TGTGCTCTGC ACACTGGATG AAAATCTAAA CAGGACTCTG GACCCCAACC 420AGGTTTCTCT GCACCCCCAA GTGCTACCTC TGCCTCATTC TTCCTCCCCT GAGCACAACA 480GACCCACTGA CCATCCAACC TCC 503 398 base pairs nucleic acid double linearcDNA to mRNA NO NO G93.seq 134 CCCTCCTATC TTCTTTCTCA TCCCTGCCTCCCTCCCATGC TGACTCCTGT GTCCCTCCCT 60 TCAGTCACTC TCCTGTCAAG TGGCTCACCTCTTGGGCCTC CCCAAGGATC CCATTCTGAA 120 AACCCCACCA AGGCAATCCA GTTGACGACCTTCCTGCTCC CTCCACAGCG CAGCCCCCGA 180 GGATCACAGT CTCTGTCCCA GAGGGGCTCTCTTCCCAGAA ACTGTCAACA CATGCCCCCT 240 TTAGACTCCT CTCATCCTCA GCCAGGACTCTGACTCCCAT TCCACAAAGG AGGCAGAAGC 300 CGTCAGAGGA CTCCCCGCAT CTTCCTGGCCTCCCAGACTC TTCTCTTACC CCTTCCTTCC 360 TAGCAGGGCC ATCTCCTCCC TGGTACTTGGAGACCTCC 398 629 base pairs nucleic acid double linear cDNA to mRNA NONO G115a.seq 135 CCTTTTTTTT TTTTAGGGGA AGCCAATTTG ATGATGTAGC TTGTGACTCCCAGGGTATTG 60 TCTACCCCAM CCTGGGGGAA AACATGGGGA AGTTTCTGGG ATCTAAGCTTCAATCGCAAG 120 CATCACAGGG GATTTTGAAA GTGATGCAGC ATCGTGAAAT GAGCATAGGGGATCGTTTGG 180 AGTCGAGCGC TTGTTATTTG AAGCTCCTCC TGTCTATGTG GTGACTCAAGGAGGGGAGAA 240 TTTCCCCTTT GTGAGCCAGC TGGACAAGTG TCAGCACTGC TGTCTTTGCAGGCTCTGGCA 300 CAGAGGCGCA GGGCCTGGAC TAAAGGGAGT CTCCAGGGTT TGGGGGTCAGACCCAGCTTA 360 ACACACATGA AAAGGAGTGA TCTTTCTCTT AAGCCATTGA GCCAGGGTCCCCCTACAGCC 420 CACAGAGCCC TTTCCACCAC CTGGCGCAGC ATCTGAACCA AACAGAGGGCATGTTTGTTG 480 CACTGCCTGG GGGTTCAGCT CCCATCCATA CCACAACACA GGACAAGGCCCGGGCTTTGC 540 ACAGCACAGT CAAGTGAGCA CACTCTCACG ATCTGATGCA GTCCTTCTCCCACACCCACC 600 ATCCAATTTT TTCCGAGGAT CCAGAATTC 629 696 base pairsnucleic acid double linear cDNA to mRNA NO NO G115b.seq 136 GTGGCACATGGAAAAAGTGG TGGTGAGGGG CAAGGAACAG GGACAAAAAA GGAACCTGGG 60 TCCTTAAGAGATCACTTAGT TGTTTTACCC AAGACTTGGC RTACAAAAAA TATCAGAAAT 120 GAGTGTCTGCTGCCAAGTGG GGTCACTGCA CGTCCTAGAA AAAAAGTATG CCTTCATCTG 180 CAGTGACACACAGCCATGGT GTTGGGACTG GCACACTGCA TCTCTAAGCC GCCAAGAGCT 240 TGACCCTGGATGGGAAGAAA ACTGCCCCAG AGATGCTGGA GCTGGCTTTA TGAACTGGCT 300 TTATGCTGGGGAGGTGGATG GCAGATCGCA TCCATCTGGT AGGTTGAGGT CCTCTTGGCA 360 AGCCTACTACCATCACCTCC CAGCAGAAGA GGACTGCAAA CTATCCTTAA AGGGGACTCG 420 GTCCAGTGAGTCTTACCTTT TTTTGAGGTC ATACACCCCT TCCGGAATCT GATGGCACAC 480 ATAGACCCTTTCCCCAGGAA AATTCACAAA CATCCAGAGT TTCATATGCC ACTAGGGGAT 540 TTAAAAGACCCTGCATCAAC TGAACTCATA ACCTGGAGTC CAATTCTTAT GAGAGGGTGG 600 CACATGGAAAAAGTGGTGGT GAGGGGCAAG GAACAGGGAC AAAAAAGGAA CCTGGGTCCT 660 TAAGAGATCACTTAGTTGTT TTACCCAAGA CTTGGC 696 574 base pairs nucleic acid doublelinear cDNA to mRNA NO NO G115c.seq 137 GGAGGCCCCA GACTGCTACT CATACAGGCAGCTGTATCTT GTCTCCAGGA GAGAGCAGGG 60 ACCCAGKATG GAGCAGACAT AGGTCTTCTGGGGACTCAGC CCTTTCGGGA GGGAGTGTGT 120 GCCCTAGGCA CACCCTTCCC ATTTGACAATCTGATATGAG GTGGAAACAG GGTCCTTGGG 180 CCCCTAAGTC ATGTTGGGAA TGTTTCCTTCTCTCAAGCCG GAAGAGCTGA GGTTTATCTG 240 AGAAATGCCT ATGTCTCTTT TGACACATCGTAGTCACTAA CCCCTTGTTC CTGCTCCAGG 300 AGCCTCTAAA AAGCCATCTA GACCAGAAAAATTGGTCTTT TTTTAGTGAT GGGAGTGGCT 360 TTAATGTCAT TCTCCCCTAT TTAGTTATGAGCTGTACCTC AGTTTTGGTC ATTAGAAATA 420 TAATTTTAGG TCAGGTGCAG CGGCTCATGCCTATAATCCC ACACTTTGGG AGGCTGAGGG 480 TGGGCAGGTT ACTTGGAGGT CAGGAGGTTAAAGACCAGCC TGGCAACATG GTGGAAACCC 540 TATCTCCACT TAAAATACAA AAATTAGTTGCATG 574 252 base pairs nucleic acid double linear cDNA to mRNA NO NOG122.seq 138 GCTCTGACTA GCCAATGAGT CTTGCTCTGA TATGGCACCT GCAAAATCTCTTTCTGGGGT 60 CTTCCACTGC CTAACTTTAG CCCCAGTAAT TTACTAGGTT CTGGCACATGGCCCATGATC 120 CTGACACCAG GCCTGCCTTT GTTTCAGCTT CACTATTCTA ATCTTTGCATTAATAGCTTG 180 TAATACCCTG GTGGCTATCA TTATATAGTG TATATGTGCA ATATCAGTATGGCTGACCTA 240 GGTCAGTCCT GT 252 278 base pairs nucleic acid doublelinear cDNA to mRNA NO NO G329f.seq 139 GTTGATCGGT GATGCCGCAA AGAATCAAGTTGCAATGAAC CCCACCAACA CAGTTTTTGA 60 TGCCAAACGT CTGATTGGAC GCAGATTTGATGATGCTGTT GTCCAGTCTA ATATGAAACA 120 TTGGCCCTTT ATGGTGGTGA ATGATGCTGGCAGGCCCAAG GTCCAAGTAG AATACAAGGG 180 AGAGACCAAA AGCTTCTATC CAGAGGAGGTGTCTTCTATG GTTCTGACAA AGATGAAGGA 240 AATTGCAGAA GTCTACCTTG GGAAGACTGTTACCAATG 278 795 base pairs nucleic acid double linear cDNA to mRNA NONO E67.seq 140 GCGCGGGGTG GACTCTTTCT GGATGTTGTA GTCAGACAGG GTGCGTCCATCTTCCAGCTG 60 TTTCCCAGCA AAGATCAACC TCTGCTGGTC AGGAGGGATG CCTTCCTTGNCTTGGGTCTT 120 TGNCTTGACA TTCTCAATGG TGTCACTCGG TTCCACTTCG AGAGTGATGGTCTTACCAGC 180 CAGGGTCTTC ACGAAGGATC TGCATCCCAC CTCTAAGACG GAGCACCAGGTGCAGGGTGG 240 GGACTCTTTT CTGGATGTTG TAGACAGACA GGGTGCGTCC ATCTTCCAGGTGTTTCCCAG 300 GAAAGGTCAA ACTCTGCTGA TCAAGAGGAT GCTCCTTGTC TGGATCTTTGCCTTGACATC 360 TCAATGGTGT CACTCGGCTC CACCTCGAGA GTGATGGTCT TACCAATCAGGGTCTTCNCG 420 GAAGATCTNC ATCCCACCTC TGAGTCGGAG CACGCAGGTG CAAGGTGGACTCTTTCTGGA 480 TGTTGTAGTC AGACAGGGTA CCGACCATCT TCCACCTGTT TTCCGGCAAAGATCAACCTC 540 TGCTGGTCAG GAGGGATCCC TTCCTTGTCT TGGAGCTTTG CCTTGACATTCTCAATGGTG 600 TCACTCGGCT CCACTTCGAG GGTGATGGTC TTACCANTNA GGGTCTTCACGAAGAACTGC 660 ATACCCCCTC TGAGANGGAC CACCAGGTGC AGGGNAGACT CTTTCTGGATGTTGTAGTCA 720 GANAGGGTGC GCCCATCTTC CAGCTGCTTT CCGGCAAAGA TCAACCTCTCCTGGTCAGGA 780 GGAATGCCTT CCTTG 795 565 base pairs nucleic acid doublelinear cDNA to mRNA NO NO E94.seq 141 GTCACTGTAG AAGTATTTTA ATGTGTCAAAACTTCCATCT GCATGTTTCT TTAATTTGCA 60 GAGGATTGAT TATTAGCTCT TTGTGCCAAATAACTGTCAC TCATTTTAAA ATCTTTCCCA 120 AACACAGGTA CTATTTCTAT TCTACATAATGGGAGAATGT GCCAGTAGGA GACTGCCTGG 180 CCAACTCTGA AAAAAATGCT TTAACAATATGCCCCAGCTA AAATCACTTT TCCTTTATTT 240 CCACAAATCA AATTCAAAAT CAAAACTCATTATGGTATAC CTTATATAAC TCGGATCATG 300 TTTATAAAAT TAGCATTCTT TGGATAGTAAAACACCAGTT AATACTTAAT TTGTTTACCC 360 ATGCACAAAA CTACCTCCCG AGATTAGACTAAGTCCCTTT AAGGATTTTA GGTCTCCATT 420 TTGAGNTGTT TTGATTTATA GAAGGATCTGAAAAAAAATC GAGGAGAAGT CGTTTTCCTC 480 CTTTGTAAAC CTTCTGCCCA GAGGCCGGCGACGNATGCAC CAGCAAGGAC AAGCCCAGTC 540 TTTTCAAGCG ACACCTGTTC GCCTG 565 420amino acids amino acid single linear protein NO NO CDC.pep 142 Met AlaAla Thr Asp Leu Glu Arg Phe Ser Asn Ala Glu Pro Glu Pro 1 5 10 15 ArgSer Leu Ser Leu Gly Gly His Val Gly Phe Asp Ser Leu Pro Asp 20 25 30 GlnLeu Val Ser Lys Ser Val Thr Gln Gly Phe Ser Phe Asn Ile Leu 35 40 45 CysVal Gly Glu Thr Gly Ile Gly Lys Ser Thr Leu Met Asn Thr Leu 50 55 60 PheAsn Thr Thr Phe Glu Thr Glu Glu Ala Ser His His Glu Ala Cys 65 70 75 80Val Arg Leu Arg Pro Gln Thr Tyr Asp Leu Gln Glu Ser Asn Val Gln 85 90 95Leu Lys Leu Thr Ile Val Asp Ala Val Gly Phe Gly Asp Gln Ile Asn 100 105110 Lys Asp Glu Ser Tyr Arg Pro Ile Val Asp Tyr Ile Asp Ala Gln Phe 115120 125 Glu Asn Tyr Leu Gln Glu Glu Leu Lys Ile Arg Arg Ser Leu Phe Asp130 135 140 Tyr His Asp Thr Arg Ser Thr Val Trp Leu Tyr Phe Ile Thr ProThr 145 150 155 160 Gly His Ser Leu Lys Ser Leu Asp Leu Val Ala Met LysLys Leu Asp 165 170 175 Ser Lys Val Asn Ile Ile Pro Ile Ile Ala Lys AlaAsp Thr Ile Ser 180 185 190 Lys Ser Glu Leu His Lys Phe Lys Ile Lys IleMet Gly Glu Leu Val 195 200 205 Ser Asn Gly Val Gln Ile Tyr Gln Phe ProThr Asp Asp Glu Ala Val 210 215 220 Ala Glu Ile Asn Val Val Met Asn AlaHis Leu Pro Phe Ala Val Val 225 230 235 240 Gly Ser Thr Glu Glu Val LysVal Gly Asn Lys Leu Val Arg Ala Arg 245 250 255 Gln Tyr Pro Trp Gly ValVal Gln Val Glu Asn Glu Asn His Cys Asp 260 265 270 Phe Val Lys Leu ArgGlu Met Leu Ile Arg Val Asn Met Glu Asp Leu 275 280 285 Arg Glu Gln ThrHis Ser Arg His Tyr Glu Leu Tyr Arg Arg Cys Lys 290 295 300 Leu Glu GluMet Gly Phe Gln Asp Ser Asp Gly Asp Ser Gln Pro Phe 305 310 315 320 SerLeu Gln Glu Thr Tyr Glu Ala Lys Arg Lys Glu Phe Leu Ser Glu 325 330 335Leu Gln Arg Lys Glu Glu Glu Met Arg Gln Met Phe Val Asn Lys Val 340 345350 Lys Glu Thr Glu Leu Glu Leu Lys Glu Lys Glu Arg Glu Leu His Glu 355360 365 Lys Phe Glu His Leu Lys Arg Val His Gln Glu Glu Lys Arg Lys Val370 375 380 Glu Glu Lys Arg Arg Glu Leu Glu Glu Glu Thr Asn Ala Phe AsnArg 385 390 395 400 Arg Lys Ala Gly Trp Glu Ala Cys Ser Arg Arg Pro CysThr Pro Pro 405 410 415 Arg Ser Ser Pro 420 321 amino acids amino acidsingle linear protein NO NO Septin-2.pep 143 Met Ala Ala Thr Asp Leu GluArg Phe Ser Asn Ala Glu Pro Glu Pro 1 5 10 15 Arg Ser Leu Ser Leu GlyGly His Val Gly Phe Asp Ser Leu Pro Asp 20 25 30 Gln Leu Val Ser Lys SerVal Thr Gln Gly Phe Ser Phe Asn Ile Leu 35 40 45 Cys Val Gly Glu Thr GlyIle Gly Lys Ser Thr Leu Met Asn Thr Leu 50 55 60 Phe Asn Thr Thr Phe GluThr Glu Glu Ala Ser His His Glu Ala Cys 65 70 75 80 Val Arg Leu Arg ProGln Thr Tyr Asp Leu Gln Glu Ser Asn Val Gln 85 90 95 Leu Lys Leu Thr IleVal Asp Ala Val Gly Phe Gly Asp Gln Ile Asn 100 105 110 Lys Asp Glu SerTyr Arg Pro Ile Val Asp Tyr Ile Asp Ala Gln Phe 115 120 125 Glu Asn TyrLeu Gln Glu Glu Leu Lys Ile Arg Arg Ser Leu Phe Asp 130 135 140 Tyr HisAsp Thr Arg Ser Thr Val Trp Leu Tyr Phe Ile Thr Pro Thr 145 150 155 160Gly His Ser Leu Lys Ser Leu Asp Leu Val Ala Met Lys Lys Leu Asp 165 170175 Ser Lys Val Asn Ile Ile Pro Ile Ile Ala Lys Ala Asp Thr Ile Ser 180185 190 Lys Ser Glu Leu His Lys Phe Lys Ile Lys Ile Met Gly Glu Leu Val195 200 205 Ser Asn Gly Val Gln Ile Tyr Gln Phe Pro Thr Asp Asp Glu AlaVal 210 215 220 Ala Glu Ile Asn Val Val Met Asn Ala His Leu Pro Phe AlaVal Val 225 230 235 240 Gly Ser Thr Glu Glu Val Lys Val Gly Asn Lys LeuVal Arg Ala Arg 245 250 255 Gln Tyr Pro Trp Gly Val Val Gln Val Glu AsnGlu Asn His Cys Asp 260 265 270 Phe Val Lys Leu Arg Glu Met Leu Ile ArgVal Asn Met Glu Asp Leu 275 280 285 Arg Glu Gln Thr His Ser Arg His TyrGlu Leu Tyr Arg Arg Cys Lys 290 295 300 Leu Glu Glu Met Gly Phe Gln AspSer Asp Gly Asp Ser Gln Pro Phe 305 310 315 320 Arg 645 amino acidsamino acid single linear protein NO NO G18.pep 144 Met Ser Arg Ile GluLys Met Ser Ile Leu Gly Val Arg Ser Phe Gly 1 5 10 15 Ile Glu Asp LysAsp Lys Gln Ile Ile Thr Phe Phe Ser Pro Leu Thr 20 25 30 Ile Leu Val GlyPro Asn Gly Ala Gly Lys Thr Thr Ile Ile Glu Cys 35 40 45 Leu Lys Tyr IleCys Thr Gly Asp Phe Pro Pro Gly Thr Lys Gly Asn 50 55 60 Thr Phe Val AsnAsp Pro Lys Val Ala Gln Glu Thr Asp Val Arg Ala 65 70 75 80 Gln Ile ArgLeu Gln Phe Arg Asp Val Asn Gly Glu Leu Ile Ala Val 85 90 95 Gln Arg SerMet Val Cys Thr Gln Lys Ser Lys Lys Thr Glu Phe Lys 100 105 110 Thr LeuGlu Gly Val Ile Thr Arg Thr Lys His Gly Glu Lys Val Ser 115 120 125 LeuSer Ser Lys Cys Ala Glu Ile Asp Arg Glu Met Ile Ser Ser Leu 130 135 140Gly Val Ser Lys Ala Val Leu Asn Asn Val Ile Phe Cys His Gln Glu 145 150155 160 Asp Ser Asn Trp Pro Leu Ser Glu Gly Lys Ala Leu Lys Gln Lys Phe165 170 175 Asp Glu Ile Phe Ser Ala Thr Arg Tyr Ile Lys Ala Leu Glu ThrLeu 180 185 190 Arg Gln Val Arg Gln Thr Gln Gly Gln Lys Val Lys Glu TyrGln Met 195 200 205 Glu Leu Lys Tyr Leu Lys Gln Tyr Lys Glu Lys Ala CysGlu Ile Arg 210 215 220 Asp Gln Ile Thr Ser Lys Glu Ala Gln Leu Thr SerSer Lys Glu Ile 225 230 235 240 Val Lys Ser Tyr Glu Asn Glu Leu Asp ProLeu Lys Asn Arg Leu Lys 245 250 255 Glu Ile Glu His Asn Leu Ser Lys IleMet Lys Leu Asp Asn Glu Ile 260 265 270 Lys Ala Leu Asp Ser Arg Lys LysGln Met Glu Lys Asp Asn Ser Glu 275 280 285 Leu Glu Glu Lys Met Glu LysVal Phe Gln Gly Thr Asp Glu Gln Leu 290 295 300 Asn Asp Leu Tyr His AsnHis Gln Arg Thr Val Arg Glu Lys Glu Arg 305 310 315 320 Lys Leu Val AspCys His Arg Glu Leu Glu Lys Leu Asn Lys Glu Ser 325 330 335 Arg Leu LeuAsn Gln Glu Lys Ser Glu Leu Leu Val Glu Gln Gly Arg 340 345 350 Leu GlnLeu Gln Ala Asp Arg His Gln Glu His Ile Arg Ala Arg Asp 355 360 365 SerLeu Ile Gln Ser Leu Ala Thr Gln Leu Glu Leu Asp Gly Phe Glu 370 375 380Arg Gly Pro Phe Ser Glu Arg Gln Ile Lys Asn Phe His Lys Leu Val 385 390395 400 Arg Glu Arg Gln Glu Gly Glu Ala Lys Thr Ala Asn Gln Leu Met Asn405 410 415 Asp Phe Ala Glu Lys Glu Thr Leu Lys Gln Lys Gln Ile Asp GluIle 420 425 430 Arg Asp Lys Lys Thr Gly Leu Gly Arg Ile Ile Glu Leu LysSer Glu 435 440 445 Ile Leu Ser Lys Lys Gln Asn Glu Leu Lys Asn Val LysTyr Glu Leu 450 455 460 Gln Gln Leu Glu Gly Ser Ser Asp Arg Ile Leu GluLeu Asp Gln Glu 465 470 475 480 Leu Ile Lys Ala Glu Arg Glu Leu Ser LysAla Glu Lys Asn Ser Asn 485 490 495 Val Glu Thr Leu Lys Met Glu Val IleSer Leu Gln Asn Glu Lys Ala 500 505 510 Asp Leu Asp Arg Thr Leu Arg LysLeu Asp Gln Glu Met Glu Gln Leu 515 520 525 Asn His His Thr Thr Thr ArgThr Gln Met Glu Met Leu Thr Lys Asp 530 535 540 Lys Ala Asp Lys Asp GluGln Ile Arg Lys Ile Lys Ser Arg His Ser 545 550 555 560 Asp Glu Leu ThrSer Leu Leu Gly Tyr Phe Pro Asn Lys Lys Gln Leu 565 570 575 Glu Asp TrpLeu His Ser Lys Ser Lys Glu Ile Asn Gln Thr Arg Asp 580 585 590 Arg LeuAla Lys Leu Lys Ile Val Leu Val Lys Pro His Ser Ile Thr 595 600 605 SerCys Tyr Asn Ser Leu Lys Lys Cys Phe Arg Asn Leu Ile Pro Leu 610 615 620Ile Glu Glu Val Leu Leu Leu Ser Ala Ala Asp Leu Glu Ala Val Ile 625 630635 640 Trp Ala Ser Ile Lys 645 527 amino acids amino acid single linearprotein NO NO G65.pep 145 Met Lys Leu Trp Val Ser Ala Leu Leu Met AlaTrp Phe Gly Val Leu 1 5 10 15 Ser Cys Val Gln Ala Glu Phe Phe Thr SerIle Gly His Met Thr Asp 20 25 30 Leu Ile Tyr Ala Glu Lys Glu Leu Val GlnSer Leu Lys Glu Tyr Ile 35 40 45 Leu Val Glu Glu Ala Lys Leu Ser Lys IleLys Ser Trp Ala Asn Lys 50 55 60 Met Glu Ala Leu Thr Ser Lys Ser Ala AlaAsp Ala Glu Gly Tyr Leu 65 70 75 80 Ala His Pro Val Asn Ala Tyr Lys LeuVal Lys Arg Leu Asn Thr Asp 85 90 95 Trp Pro Ala Leu Glu Asp Leu Val LeuGln Asp Ser Ala Ala Gly Phe 100 105 110 Ile Ala Asn Leu Ser Val Gln ArgGln Phe Phe Pro Thr Asp Glu Asp 115 120 125 Glu Ile Gly Ala Ala Lys AlaLeu Met Arg Leu Gln Asp Thr Tyr Arg 130 135 140 Leu Asp Pro Gly Thr IleSer Arg Gly Glu Leu Pro Gly Thr Lys Tyr 145 150 155 160 Gln Ala Met LeuSer Val Asp Asp Cys Phe Gly Met Gly Arg Ser Ala 165 170 175 Tyr Asn GluGly Asp Tyr Tyr His Thr Val Leu Trp Met Glu Gln Val 180 185 190 Leu LysGln Leu Asp Ala Gly Glu Glu Ala Thr Thr Thr Lys Ser Gln 195 200 205 ValLeu Asp Tyr Leu Ser Tyr Ala Val Phe Gln Leu Gly Asp Leu His 210 215 220Arg Ala Leu Glu Leu Thr Arg Arg Leu Leu Ser Leu Asp Pro Ser His 225 230235 240 Glu Arg Ala Gly Gly Asn Leu Arg Tyr Phe Glu Gln Leu Leu Glu Glu245 250 255 Glu Arg Glu Lys Thr Leu Thr Asn Gln Thr Glu Ala Glu Leu AlaThr 260 265 270 Pro Glu Gly Ile Tyr Glu Arg Pro Val Asp Tyr Leu Pro GluArg Asp 275 280 285 Val Tyr Glu Ser Leu Cys Arg Gly Glu Gly Val Lys LeuThr Pro Arg 290 295 300 Arg Gln Lys Arg Leu Phe Cys Arg Tyr His His GlyAsn Arg Ala Pro 305 310 315 320 Gln Leu Leu Ile Ala Pro Phe Lys Glu GluAsp Glu Trp Asp Ser Pro 325 330 335 His Ile Val Arg Tyr Tyr Asp Val MetSer Asp Glu Glu Ile Glu Arg 340 345 350 Ile Lys Glu Ile Ala Lys Pro LysLeu Ala Arg Ala Thr Val Arg Asp 355 360 365 Pro Lys Thr Gly Val Leu ThrVal Ala Ser Tyr Arg Val Ser Lys Ser 370 375 380 Ser Trp Leu Glu Glu AspAsp Asp Pro Val Val Ala Arg Val Asn Arg 385 390 395 400 Arg Met Gln HisIle Thr Gly Leu Thr Val Lys Thr Ala Glu Leu Leu 405 410 415 Gln Val AlaAsn Tyr Gly Val Gly Gly Gln Tyr Glu Pro His Phe Asp 420 425 430 Phe SerArg Arg Pro Phe Asp Ser Gly Leu Pro Thr Leu Gly Gln Arg 435 440 445 GlyIle Val Leu Ala Thr Phe Leu Asn Tyr Met Ser Asp Val Glu Ala 450 455 460Gly Gly Ala Thr Val Phe Pro Asp Leu Gly Ala Ala Ile Trp Pro Lys 465 470475 480 Lys Gly Thr Lys Leu Cys Ser Gly Thr Thr Ser Cys Gly Ala Gly Lys485 490 495 Val Thr Thr Glu Gln Asp Met Leu Pro Ala Leu Cys Leu Trp AlaAla 500 505 510 Ser Gly Ser Pro Ile Ser Gly Ser Met Asn Glu Asp Arg SerSer 515 520 525 727 base pairs nucleic acid double linear cDNA to mRNANO NO TcA - N-terminal fragment 146 GTTGGTGTCA GCGGGCAACA GCCCACAGGAGTGTGCACCT CCTAGGACAG AGTTTGTCCT 60 CTCACTTCTG GAGAAGATGC AGACACAGGAGATCCTGAGG ATACTGCGAC TGCCTGAGCT 120 AGGTGACTTG GGACAGTTTT TCCGCAGCCTCTCGGCCACC ACCCTCGTGA GTATGGGTGC 180 CCTGGCTGCC ATCCTTGCCT ACTGGTTCACTCACCGGCCA AAGGCCTTGC AGCCGCCATG 240 CAACCTCCTG ATGCAGTCAG AAGAAGTAGAGGACAGTGGC GGGGCACGGC GATCTGTGAT 300 TGGGTCTGGC CCTCAGCTAC TTACCCACTACTATGATGAT GCCCGGACCA TGTACCAGGT 360 GTTCCGCCGT GGGCTTAGCA TCTCAGGGAATGGGCCCTGT CTTGGTTTCA GGAAGCCTAA 420 GCAGCCTTAC CAGTGGCTGT CCTACCAGGAGGTGGCCGAC AGGGCTGAAT TTCTGGGGTC 480 CGGACTTCTC CAGCACAATT GTAAAGCATGCACTGATCAG TTTATTGGTG TTTTTGCACA 540 AAATCGGCCA GAGTGGATCA TTGTGGAGCTGGCCTGCTAC ACATATTCCA TGGTGGTGGT 600 CCCGCTCTAT GACACCCTGG GCCCTGGGGCTATCCGCTAC ATCATCAATA CAGCGGACAT 660 CAGCACCGTG ATTGTGGACA AACCTCAGAAGGCTGTGCTT CTGCTAGAGC ATGTGGAGAG 720 GAAGGAG 727 874 base pairs nucleicacid double linear cDNA to mRNA NO NO TcA - C-terminal fragment 147GTGTGAGAGG ACCAAATGTG TTCAAAGGCT ACTTGAAAGA TCCAGACAGG ACGAAGGAGG 60CCCTGGACAG CGATGGCTGG CTTCACACTG GAGACATCGG AAAATGGCTG CCGGCAGGAA 120CTCTTAAAAT TATTGATCGG AAAAAGCATA TATTTAAACT TGCTCAGGGA GAATATGTTG 180CACCCGAGAA GATTGAGAAC ATCTACATCC GGAGCCAACC TGTGGCGCAA ATCTATGTCC 240ATGGGGACAG CTTAAAGGCC TTTTTGGTAG GCATTGTTGT GCCTGACCCT GAAGTTATGC 300CCTCCTGGGC CCAGAAGAGA GGAATTGAAG GAACATATGC AGATCTCTGC ACAAATAAGG 360ATCTGAAGAA AGCCATTTTG GAAGATATGG TGAGGTTAGG AAAAGAAAGT GGACTCCATT 420CTTTTGAGCA GGTTAAAGCC ATTCACATCC ATTCTGACAT GTTCTCAGTT CAAAATGGCT 480TGCTGACACC AACACTAAAA GCTAAGAGAC CTGAGCTGAG AGAGTACTTC AAAAAACAAA 540TAGAAGAGCT TTACTCAATC TCCATGTGAA GTTCAAGGAA AGTTCTTCTC AGTGTAATGA 600ACTGTCTAGC AATATTATAG TTATTCTTGA AAGTAATGAG TCAAAATGAC ACAGCTGAAA 660ATGAATAAGC ATCTGATTTT ATGACTGAGC CTTTTCCTGT CCCAAGAGGT CTTTAACAAT 720ATTTTCTCTA TCATCAATGA GTATATTTTA TTTTTATTAT AAAAATGATA TTGTGGTGGA 780CTGCTAAAAA TATCACAAAT GGCAATGTAA AAATCAAGAC ATTTTCTCAA GAACTGTGTA 840CCACTAAAAG TAATATATTG TCAATGTTCA CAGG 874 1312 amino acids amino acidlinear protein NO Rad50.pro-translation of SEQ ID NO54 148 Met Ser ArgIle Glu Lys Met Ser Ile Leu Gly Val Arg Ser Phe Gly 1 5 10 15 Ile GluAsp Lys Asp Lys Gln Ile Ile Thr Phe Phe Ser Pro Leu Thr 20 25 30 Ile LeuVal Gly Pro Asn Gly Ala Gly Lys Thr Thr Ile Ile Glu Cys 35 40 45 Leu LysTyr Ile Cys Thr Gly Asp Phe Pro Pro Gly Thr Lys Gly Asn 50 55 60 Thr PheVal His Asp Pro Lys Val Ala Gln Glu Thr Asp Val Arg Ala 65 70 75 80 GlnIle Arg Leu Gln Phe Arg Asp Val Asn Gly Glu Leu Ile Ala Val 85 90 95 GlnArg Ser Met Val Cys Thr Gln Lys Ser Lys Lys Thr Glu Phe Lys 100 105 110Thr Leu Glu Gly Val Ile Thr Arg Thr Lys His Gly Glu Lys Val Ser 115 120125 Leu Ser Ser Lys Cys Ala Glu Ile Asp Arg Glu Met Ile Ser Ser Leu 130135 140 Gly Val Ser Lys Ala Val Leu Asn Asn Val Ile Phe Cys His Gln Glu145 150 155 160 Asp Ser Asn Trp Pro Leu Ser Glu Gly Lys Ala Leu Lys GlnLys Phe 165 170 175 Asp Glu Ile Phe Ser Ala Thr Arg Tyr Ile Lys Ala LeuGlu Thr Leu 180 185 190 Arg Gln Val Arg Gln Thr Gln Gly Gln Lys Val LysGlu Tyr Gln Met 195 200 205 Glu Leu Lys Tyr Leu Lys Gln Tyr Lys Glu LysAla Cys Glu Ile Arg 210 215 220 Asp Gln Ile Thr Ser Lys Glu Ala Gln LeuThr Ser Ser Lys Glu Ile 225 230 235 240 Val Lys Ser Tyr Glu Asn Glu LeuAsp Pro Leu Lys Asn Arg Leu Lys 245 250 255 Glu Ile Glu His Asn Leu SerLys Ile Met Lys Leu Asp Asn Glu Ile 260 265 270 Lys Ala Leu Asp Ser ArgLys Lys Gln Met Glu Lys Asp Asn Ser Glu 275 280 285 Leu Glu Glu Lys MetGlu Lys Val Phe Gln Gly Thr Asp Glu Gln Leu 290 295 300 Asn Asp Leu TyrHis Asn His Gln Arg Thr Val Arg Glu Lys Glu Arg 305 310 315 320 Lys LeuVal Asp Cys His Arg Glu Leu Glu Lys Leu Asn Lys Glu Ser 325 330 335 ArgLeu Leu Asn Gln Glu Lys Ser Glu Leu Leu Val Glu Gln Gly Arg 340 345 350Leu Gln Leu Gln Ala Asp Arg His Gln Glu His Ile Arg Ala Arg Asp 355 360365 Ser Leu Ile Gln Ser Leu Ala Thr Gln Leu Glu Leu Asp Gly Phe Glu 370375 380 Arg Gly Pro Phe Ser Glu Arg Gln Ile Lys Asn Phe His Lys Leu Val385 390 395 400 Arg Glu Arg Gln Glu Gly Glu Ala Lys Thr Ala Asn Gln LeuMet Asn 405 410 415 Asp Phe Ala Glu Lys Glu Thr Leu Lys Gln Lys Gln IleAsp Glu Ile 420 425 430 Arg Asp Lys Lys Thr Gly Leu Gly Arg Ile Ile GluLeu Lys Ser Glu 435 440 445 Ile Leu Ser Lys Lys Gln Asn Glu Leu Lys AsnVal Lys Tyr Glu Leu 450 455 460 Gln Gln Leu Glu Gly Ser Ser Asp Arg IleLeu Glu Leu Asp Gln Glu 465 470 475 480 Leu Ile Lys Ala Glu Arg Glu LeuSer Lys Ala Glu Lys Asn Ser Asn 485 490 495 Val Glu Thr Leu Lys Met GluVal Ile Ser Leu Gln Asn Glu Lys Ala 500 505 510 Asp Leu Asp Arg Thr LeuArg Lys Leu Asp Gln Glu Met Glu Gln Leu 515 520 525 Asn His His Thr ThrThr Arg Thr Gln Met Glu Met Leu Thr Lys Asp 530 535 540 Lys Ala Asp LysAsp Glu Gln Ile Arg Lys Ile Lys Ser Arg His Ser 545 550 555 560 Asp GluLeu Thr Ser Leu Leu Gly Tyr Phe Pro Asn Lys Lys Gln Leu 565 570 575 GluAsp Trp Leu His Ser Lys Ser Lys Glu Ile Asn Gln Thr Arg Asp 580 585 590Arg Leu Ala Lys Leu Asn Lys Glu Leu Ala Ser Ser Glu Gln Asn Lys 595 600605 Asn His Ile Asn Asn Glu Leu Lys Arg Arg Glu Glu Gln Leu Ser Ser 610615 620 Tyr Glu Asp Lys Leu Phe Asp Val Cys Gly Ser Gln Asp Phe Glu Ser625 630 635 640 Asp Leu Asp Arg Leu Lys Glu Glu Ile Glu Lys Ser Ser LysGln Arg 645 650 655 Ala Met Leu Ala Gly Ala Thr Ala Val Tyr Ser Gln PheIle Thr Gln 660 665 670 Leu Thr Asp Glu Asn Gln Ser Cys Cys Pro Val CysGln Arg Val Phe 675 680 685 Gln Thr Glu Ala Glu Leu Gln Glu Val Ile SerAsp Leu Gln Ser Lys 690 695 700 Leu Arg Leu Ala Pro Asp Lys Leu Lys SerThr Glu Ser Glu Leu Lys 705 710 715 720 Lys Lys Glu Lys Arg Arg Asp GluMet Leu Gly Leu Val Pro Met Arg 725 730 735 Gln Ser Ile Ile Asp Leu LysGlu Lys Glu Ile Pro Glu Leu Arg Asn 740 745 750 Lys Leu Gln Asn Val AsnArg Asp Ile Gln Arg Leu Lys Asn Asp Ile 755 760 765 Glu Glu Gln Glu ThrLeu Leu Gly Thr Ile Met Pro Glu Glu Glu Ser 770 775 780 Ala Lys Val CysLeu Thr Asp Val Thr Ile Met Glu Arg Phe Gln Met 785 790 795 800 Glu LeuLys Asp Val Glu Arg Lys Ile Ala Gln Gln Ala Ala Lys Leu 805 810 815 GlnGly Ile Asp Leu Asp Arg Thr Val Gln Gln Val Asn Gln Glu Lys 820 825 830Gln Glu Lys Gln His Lys Leu Asp Thr Val Ser Ser Lys Ile Glu Leu 835 840845 Asn Arg Lys Leu Ile Gln Asp Gln Gln Glu Gln Ile Gln His Leu Lys 850855 860 Ser Thr Thr Asn Glu Leu Lys Ser Glu Lys Leu Gln Ile Ser Thr Asn865 870 875 880 Leu Gln Arg Arg Gln Gln Leu Glu Glu Gln Thr Val Glu LeuSer Thr 885 890 895 Glu Val Gln Ser Leu Tyr Arg Glu Ile Lys Asp Ala LysGlu Gln Val 900 905 910 Ser Pro Leu Glu Thr Thr Leu Glu Lys Phe Gln GlnGlu Lys Glu Glu 915 920 925 Leu Ile Asn Lys Lys Asn Thr Ser Asn Lys IleAla Gln Asp Lys Leu 930 935 940 Asn Asp Ile Lys Glu Lys Val Lys Asn IleHis Gly Tyr Met Lys Asp 945 950 955 960 Ile Glu Asn Tyr Ile Gln Asp GlyLys Asp Asp Tyr Lys Lys Gln Lys 965 970 975 Glu Thr Glu Leu Asn Lys ValIle Ala Gln Leu Ser Glu Cys Glu Lys 980 985 990 His Lys Glu Lys Ile AsnGlu Asp Met Arg Leu Met Arg Gln Asp Ile 995 1000 1005 Asp Thr Gln LysIle Gln Glu Arg Trp Leu Gln Asp Asn Leu Thr Leu 1010 1015 1020 Arg LysArg Asn Glu Glu Leu Lys Glu Val Glu Glu Glu Arg Lys Gln 1025 1030 10351040 His Leu Lys Glu Met Gly Gln Met Gln Val Leu Gln Met Lys Ser Glu1045 1050 1055 His Gln Lys Leu Glu Glu Asn Ile Asp Asn Ile Lys Arg AsnHis Asn 1060 1065 1070 Leu Ala Leu Gly Arg Gln Lys Gly Tyr Glu Glu GluIle Ile His Phe 1075 1080 1085 Lys Lys Glu Leu Arg Glu Pro Gln Phe ArgAsp Ala Glu Glu Lys Tyr 1090 1095 1100 Arg Glu Met Met Ile Val Met ArgThr Thr Glu Leu Val Asn Lys Asp 1105 1110 1115 1120 Leu Asp Ile Tyr TyrLys Thr Leu Asp Gln Ala Ile Met Lys Phe His 1125 1130 1135 Ser Met LysMet Glu Glu Ile Asn Lys Ile Ile Arg Asp Leu Trp Arg 1140 1145 1150 SerThr Tyr Arg Gly Gln Asp Ile Glu Tyr Ile Glu Ile Arg Ser Asp 1155 11601165 Ala Asp Glu Asn Val Ser Ala Ser Asp Lys Arg Arg Asn Tyr Asn Tyr1170 1175 1180 Arg Val Val Met Leu Lys Gly Asp Thr Ala Leu Asp Met ArgGly Arg 1185 1190 1195 1200 Cys Ser Ala Gly Gln Lys Val Leu Ala Ser LeuIle Ile Arg Leu Ala 1205 1210 1215 Leu Ala Glu Thr Phe Cys Leu Asn CysGly Ile Ile Ala Leu Asp Glu 1220 1225 1230 Pro Thr Thr Asn Leu Asp ArgGlu Asn Ile Glu Ser Leu Ala His Ala 1235 1240 1245 Leu Val Glu Ile IleLys Ser Arg Ser Gln Gln Arg Asn Phe Gln Leu 1250 1255 1260 Leu Val IleThr His Asp Glu Asp Phe Val Glu Leu Leu Gly Arg Ser 1265 1270 1275 1280Glu Tyr Val Glu Lys Phe Tyr Arg Ile Lys Lys Asn Ile Asp Gln Cys 12851290 1295 Ser Glu Ile Val Lys Cys Ser Val Ser Ser Leu Gly Phe Asn ValHis 1300 1305 1310 22 base pairs nucleic acid single linear DNA NO NOPrimer A116-2 149 GGACCAGTAC TTCCTGAGCT TG 22 22 base pairs nucleic acidsingle linear DNA (genomic) NO NO A116-1 150 TTGGTGCTGA ATACCAGCCC TG 22650 base pairs nucleic acid unknown linear cDNA to mRNA NO NOa94g6ds-116f.seq 151 GCCACTCACA CAGCATCTCC AAGATCAGGG ACCAGTACTTCCTGAGCTTG ACAGAGAATG 60 AATGTGTCAG ACTGACCTCT GCCCATTTTG TAGTTTTCTCATCATTTTCT CACTCAGTCT 120 TCCCTTTTCA AGGGCCCACA CTCTTCCCGA GGGCTGGGCCTAGTGAGCGG GGTCACAGTA 180 CATATGGTTT CTGGGACTGA GAAGGTGGAA GATGTGTCCATAGAGCTTTT GTTTCCTAAG 240 CAACGTATTA CTGCCATGAT TCCATTCCCT AGATGATGCTGGTGATGCAA GCTGGCTTCT 300 CTTGGCCAGC CTACCCTACT GCTGGGTAGT GTTTATGCCCCATGGCCAGA CACTGAAGAG 360 GGAGACAGGA AAAGCACATA TCCACACCTT CCACCCTCAGACATTCCTGT AACTTGAGCT 420 TATCTAAGGG GGCATTGTCA TATGTCAGGG GTTCCCAAACTACGGTCTTC AGAAACACTG 480 TTTACCCTCC ATAGAGGTTG TGTGCATCAG CCCAGGCAGAATCCTGCTTC ATGAAGGTGT 540 TTTCCTAATG CATGTGTGCA TGGACCTGTC TCATGCTACACTGCAGGGCT GGTATTCAGC 600 ACCAATAGTT ATTGTTGGCT GCTAAAATAG CAAACTAGCCAAAATGGCAG 650

It is claimed:
 1. A substantially-isolated polynucleotide having asequence encoding a human polypeptide having immunomodulatory activity,where said polynucleotide contains a sequence selected from the groupconsisting of SEQ ID NO:65, SEQ ID NO:66, SEQ ID NO:67, SEQ ID NO:68,SEQ ID NO:70, SEQ ID NO:71, SEQ ID NO:72, SEQ ID NO:73, SEQ ID NO:74,SEQ ID NO:76, SEQ ID NO:78, SEQ ID NO:79, SEQ ID NO:82, SEQ ID NO:83,SEQ ID NO:85, SEQ ID NO:86, SEQ ID NO:88, SEQ ID NO:92, SEQ ID NO:95,SEQ ID NO:98, SEQ ID NO:99, SEQ ID NO:100, SEQ ID NO:104, SEQ ID NO:105,SEQ ID NO:106, SEQ ID NO:107, SEQ ID NO:108, SEQ ID NO:109, SEQ IDNO:112, SEQ ID NO:113, SEQ ID NO:114, SEQ ID NO:115, SEQ ID NO:124, SEQID NO:130, SEQ ID NO:132, SEQ ID NO:133, SEQ ID NO:134, SEQ ID NO:135,SEQ ID NO:136, SEQ ID NO:137 and SEQ ID NO:138.
 2. A polynucleotide ofclaim 1 containing a sequence selected from the group represented by SEQID NO:65, SEQ ID NO:66, SEQ ID NO:67, SEQ ID NO:68, SEQ ID NO:70, SEQ IDNO:71, SEQ ID NO:72, SEQ ID NO:73, SEQ ID NO:74, SEQ ID NO:76, SEQ IDNO:85, SEQ ID NO:86, SEQ ID NO:92, SEQ ID NO:95, SEQ ID NO:99, SEQ IDNO:105, SEQ ID NO:106, SEQ ID NO:107, SEQ ID NO:124, SEQ ID NO:130, SEQID NO:135, SEQ ID NO:136, SEQ ID NO:137 and SEQ ID NO:138.
 3. Apolynucleotide of claim 2 containing a sequence selected from the grouprepresented by SEQ ID NO:65, SEQ ID NO:66, SEQ ID NO:67, SEQ ID NO:70,SEQ ID NO:71, SEQ ID NO:72, SEQ ID NO:73, SEQ ID NO:74, SEQ ID NO:75,SEQ ID N0:76, SEQ ID NO:78 and SEQ ID NO:79.
 4. A substantially-isolatedhuman polypeptide having immunomodulatory activity, where saidpolypeptide has a sequence encoded by a polynucleotide selected from thegroup consisting of SEQ ID NO:65, SEQ ID NO:66, SEQ ID NO:67, SEQ IDNO:68, SEQ ID NO:70, SEQ ID NO:71, SEQ ID NO:72, SEQ ID NO:73, SEQ IDNO:74, SEQ ID NO:76, SEQ ID NO:78, SEQ ID NO:79, SEQ ID NO:82, SEQ IDNO:83, SEQ ID NO:85, SEQ ID NO:86, SEQ ID NO:88, SEQ ID NO:92, SEQ IDNO:95, SEQ ID NO:98, SEQ ID NO:99, SEQ ID NO:100, SEQ ID NO:104, SEQ IDNO:105, SEQ ID NO:106, SEQ ID NO:107, SEQ ID NO:108, SEQ ID NO:109, SEQID NO:112, SEQ ID NO:113, SEQ ID NO:114, SEQ ID NO:115, SEQ ID NO:124,SEQ ID NO:130, SEQ ID NO:132, SEQ ID NO:133, SEQ ID NO:134, SEQ IDNO:135, SEQ ID NO:136, SEQ ID NO:137 and SEQ ID NO:138.
 5. A polypeptideof claim 4, where said polypeptide has a sequence encoded by apolynucleotide selected from the group consisting of SEQ ID NO:65, SEQID NO:66, SEQ ID NO:67, SEQ ID NO:68, SEQ ID NO:70, SEQ ID NO:71, SEQ IDNO:72, SEQ ID NO:73, SEQ ID NO:74, SEQ ID NO:76, SEQ ID NO:85, SEQ IDNO:86, SEQ ID NO:92, SEQ ID NO:95, SEQ ID NO:99, SEQ ID NO:105, SEQ IDNO:106, SEQ ID NO:107, SEQ ID NO:124, SEQ ID NO:130, SEQ ID NO:135, SEQID NO:136, SEQ ID NO:137 and SEQ ID NO:138.
 6. A polypeptide of claim 4,where said polypeptide has a sequence encoded by a polynucleotideselected from the group consisting of SEQ ID NO:65, SEQ ID NO:66, SEQ IDNO:67, SEQ ID NO:70, SEQ ID NO:71, SEQ ID NO:72, SEQ ID NO:73, SEQ IDNO:74, SEQ ID NO:75, SEQ ID NO:76, SEQ ID NO:78 and SEQ ID NO:79.
 7. Asubstantially-isolated polynucleotide having a sequence encoding a humanhomologue of yeast RAD50.
 8. A polynucleotide of claim 7 containing asequence selected from the group consisting of SEQ ID NO:54 and SEQ IDNO:55.
 9. A substantially-isolated human homolog of yeast RAD50polypeptide.
 10. A polypeptide of claim 9 containing a polypeptidesequence encoded by a polynucleotide sequence selected from the groupconsisting of SEQ ID NO:54 and SEQ ID NO:55.
 11. Asubstantially-isolated polynucleotide having a sequence encoding a humanhomologue of Drosophila melanogaster Septin-2.
 12. A polynucleotide ofclaim 11 containing a sequence represented by SEQ ID NO:97.
 13. Asubstantially-isolated human Septin-2 homolog polypeptide.
 14. Apolypeptide of claim 13 containing a polypeptide sequence encoded by thepolynucleotide sequence represented by SEQ ID NO:97.
 15. A method ofidentifying the presence of activated T-cells in a sample containing aplurality of different cell types, comprising performing a polymerasechain reaction amplification, where an aliquot of said sample serves asan amplification target and where said amplification is done using anoligonucleotide primer pair capable of selective amplification of apolynucleotide fragment having the sequence represented as SEQ IDNO:151, to generate an amplification product having a specific size, anddetermining the size of the amplification product, wherein presence ofamplification product of an expected size is indicative of the presenceof activated T cells in the sample.
 16. A method of claim 15, where saidprimer pair consists of primers having sequences represented as SEQ IDNO:149 and SEQ ID NO:150.
 17. A method of claim 15, where said sample isfrom adult tissue.